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ABSTRACT 


Prokaryotic type Il adaptive immune systems have 
been developed into the versatile CRISPR 
technology, which has been widely applied in site- 
specific genome editing and has revolutionized 
biomedical research due to its superior efficiency 
and flexibility. Recent studies have greatly diversified 
CRISPR technologies by coupling it with various 
DNA repair mechanisms and targeting strategies. 
These new advances have significantly expanded 
the generation of genetically modified animal 
models, either by including species in which targeted 
genetic modification could not be achieved 
previously, or through introducing complex genetic 
modifications that take multiple steps and cost years 
to achieve using traditional methods. Herein, we 
review the recent developments and applications of 
CRISPR-based technology in generating various 
animal models, and discuss the everlasting impact of 
this new progress on biomedical research. 


Keywords: CRISPR/Cas9; Genome editing; Animal 
models 


INTRODUCTION 


Genome editing by manipulating functional DNA sequences in 
the host genome is a fundamental strategy for biomedical 
research. Starting from the discovery of the basic principles of 
DNA structure and genome organization, scientists have 
investigated various strategies for many decades to improve 
genome editing technology for different research and 
application purposes. 

In the 1980s, gene targeting methods emerged together with 
a deepening understanding of DNA repair mechanisms. Back 
then, DNA conversion was found to occur between homology 
sequences, often termed homologous recombination (HR) (Zinn 
& Butow, 1985). Early studies took advantage of this finding to 
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replace a selected endogenous genome DNA segment with a 
foreign DNA donor carrying homology sequences in living cells 
(Vasquez et al., 2001). Subsequently, by combining this with 
mouse embryonic stem cell (ESC) technology established at 
the same time, traditional gene targeting technology was 
developed to generate genetically modified mice (Koller et al., 
1989). Since 1989, genetic modification by HR-based gene 
targeting in living mammals has become a fundamental 
approach to analyze gene functions and has revolutionized our 
understanding of mammalian development, metabolism, and 
genetic diseases (Capecchi, 2005; Koller et al., 1989). 

Traditional HR-based gene targeting is associated with low 
efficiency and requires laborious clonal expansions and 
sophisticated selections to identify target cells carrying the 
desired modifications (Koller et al., 1989). With pioneering studies 
finding that the introduction of double-strand breaks (DSBs) in 
target DNA by rare-cutting endonuclease l-Sce-1 could increase 
HR efficiency by several orders of magnitude in the subsequent 
DNA repair process (Rouet et al., 1994), extensive effort has 
been made to develop programmable endonucleases. 

Zinc finger nuclease (ZFN), which was first reported in 1986 as 
an artificial nuclease to carry a zinc finger domain and a catalytic 
domain from restriction enzyme Fokl, was suitable for introducing 
DNA cleavage and enhancing HR-dependent gene targeting 
(Bibikova et al., 2002; Kim et al., 1996). However, the laborious 
work involved in the design and identification of an efficient ZEN 
to a newly selected target sequence significantly limited its utility. 
Transcription activator-like effector protein (TALE), which 
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originated in plant pathogen Xanthomonas sp., was found to 
recognize target DNA with highly conserved yet variable repetitive 
elements, each showing a preference to bind to specific 
nucleotides (Boch et al., 2009; Moscou & Bogdanove, 2009). 
Fusion of the programmable TALE domains and Fokl catalytic 
domain thus yielded TALE-nuclease (TALEN), which is easier to 
construct and can introduce DNA cleavage and targeted genome 
modification equally efficiently as ZFN (Christian et al., 2010). 

More recently, an RNA-guided DNA-targeting approach was 
developed from the type ll prokaryotic clustered regularly 
interspaced short palindromic repeats (CRISPR) adaptive 
immune system (Bhaya et al., 2011; Wiedenheft et al., 2012). In 
this system, a programmable small guide RNA (sgRNA) 
complexes with Cas9 nuclease and anneals with a 20-nt target 
DNA sequence, at the presence of the adjacent NGG PAM 
(proto-spacer adjacent motif) sequence in a base-pairing 
manner. This process allows Cas9 to introduce DSB at the 
target region and enables genome modification in a site-specific 
manner (Jinek et al, 2012). The ease of constructing a 
sequence-specific sgRNA and the highly specific RNA-DNA 
recognition has made the CRISPR/Cas9 system superior to 
ZFN and TALEN, becoming the most popular tool for 
introducing programmed DNA cleavage as well as site-specific 
genome modifications in cells and animals (Barrangou A 
Doudna, 2016; Mali et al., 2013; Ran et al., 2013). 

These recent advances in engineered nucleases, especially 
the CRISPR/Cas system, have opened new prospects for 
accomplishing robust gene targeting in previously non-permissive 
cell contexts. More importantly, it has widely revolutionized 
biomedical research by promoting quick generation of various 
animal models, which either carry complex genome modifications 
or are derived from species that could not be genetically modified 
previously (Dow et al., 2015; Swiech et al., 2015; Yin et al., 2016). 
Such progress has provided a wide range of methods as well as 
advanced animal models to study gene function and biological 
processes, significantly promoting research under in vivo 
conditions. Hence, in this review, we focus on summarizing the 
recent developments and applications of CRISPR-based 
technology in generating various animal models. 


OVERVIEW OF RECENT DEVELOPMENTS IN CRISPR- 
BASED ANIMAL MODELS 


Since early 2013, when the first successful CRISPR-based 
genome editing was demonstrated in mammalian cells (Mali et 
al., 2013), the number of studies using the CRISPR system has 
grown dramatically. Among the CRISPR-based in vivo studies, 
the majority (61.2%) have been conducted using mouse models 
(Figure 1, left panel). With the comprehensive knowledge and 
technologies established so far, research investigations using 
CRISPR technology in mouse models have covered various 
areas of biomedical research, including inherited metabolic 
disorders (Xue et al., 2014; Yang et al., 2016), cancer (Maddalo 
et al., 2014; Platt et al., 2014), neurology and neuroscience (Li 
et al., 2015c; Swiech et al., 2015), and virus-related studies 
(Jiang et al., 2017; Zhu et al., 2016). 

In addition to mouse models, CRISPR-based genome editing 


has been demonstrated in large mammals such as pigs and 
monkeys to establish disease or genetic models for organ 
transplantation (Niu et al., 2014; Yu et al., 2016). At the same time, 
CRISPR/Cas9 technology has also been applied in various lower 
vertebrate and invertebrate models (Irion et al., 2014; Shi et al., 
2015; Wen et al., 2016). The success of CRISPR technology is 
particularly valuable in lower vertebrate models, such as Xenopus 
and zebrafish (lrion et al., 2014; Shi et al., 2015), in which 
targeted genome editing could not be achieved previously. 


Molecular mechanisms for various genome editing strategies 

Sequence-specific DNA cleavage induced by any of the above 
engineered nucleases will elicit endogenous cellular responses 
to repair the damaged DNA in target cells. Utilizing various DNA 
repair mechanisms to induce mutations/deletions or to incorporate 
insertions of foreign DNA lays the foundation for genome 
editing. Cellular repair of DNA damage is mediated by two main 
pathways, namely, homology-directed repair (HDR) and non- 
homologous end joining (NHEJ). Despite their varied activities 
in different cell types and species, both pathways are highly 
conserved, from yeasts to mammals (Taylor & Lehmann, 1998). 

The HDR pathway mediates a strand-exchange process to 
repair DNA damage based on existing homologous DNA 
sequences (Heyer et al., 2010), allowing precise insertion of 
foreign DNA at target regions by replacing endogenous 
genomic segments with donor DNA. CRISPR/Cas9-introduced 
site-specific DNA cleavage triggers DNA repair and greatly 
promotes HR at nearby regions, thus enhancing the efficiency 
of HDR-based genome editing (Yang et al., 2013). In contrast, 
the conventional NHEJ pathway initiates DNA repair with quick 
occupation by the Ku70/Ku80 complex at DNA broken ends, 
followed by recruitment of other components for end processing 
and subsequently DNA ligase IV for ligation. NHEJ-based DNA 
repair is a homology-independent and mechanistically flexible 
process, which often results in random insertions or deletions 
(indels) of a small number of nucleotides (Lieber, 2010). Hence, 
CRISPR/Cas9-induced NHEJ repair has been employed to 
generate loss-of-function alleles in protein-coding genes (Wang 
et al., 2013). In general, the NHEJ pathway mediates rapid 
DNA repair and plays an important role in various cellular 
contexts. Therefore, CRISPR/Cas9-induced NHEJ repair offers 
high efficiency and has been exploited to develop a variety of 
targeting strategies. 

More recently, in addition to the conventional HDR and NHEJ 
pathways, studies have discovered the microhomology-mediated 
end joining (MMEJ) pathway, which is also termed as alternative 
NHEJ (AIt-NHEJ) pathway (Lieber, 2010; McVey & Lee, 2008). 
This MMEJ pathway repairs DNA damage by initiating single- 
strand resection similar to the HDR process, followed by 
microhomology-based alignment and ligation by DNA ligase III. In 
general, the MMEJ pathway mediates an error-prone repair 
process and plays a minor role to complement DNA repair by 
HDR and NHEJ (McVey & Lee, 2008). Collectively, the coupling 
of different DNA repair mechanisms with various strategies to 
design donor templates or select target sites in genomes, has 
resulted in a variety of targeting approaches, each having distinct 
advantages in different species (Table 1). 
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Table 1 Summary of CRISPR-based in vivo genome editing in zygotes via different DNA repair mechanisms in different species 








Genome modifications and Species ESCs Efficiency** References 
targeting strategies involvement 
NHEJ-based knockout by Mouse: 
introducing indels Multiple genes No 50%-80% Dow et al., 2015; Mandasari et al., 2016; Swiech et al., 
2015; Wang et al., 2013; Xu et al., 2017 
Single gene 8%-90% Challa et al., 2016; Hay et al., 2017; Helsley et al., 2016; 
Hinze et al., 2017; Ishikawa-Fujiwara et al., 2017; Jiang et 
al., 2017; Kasparek et al., 2016; Kim et al., 2017b; Li et 
al., 2015d; Mandasari et al., 2016; Meyer et al., 2016; 
Mianné et al., 2017; Miyata et al., 2016; Sweeney et al., 
2017; Wang et al., 2015a; Zhang et al., 2017b; Zhong et 
al., 2015; Zhu et al., 2016 
Rat: 
Single gene No 28.696—45.596 Rannals et al., 2016; Wang et al., 2016c; Yoshimi et al., 2014 
Pig: 
Multiple genes No N/A Wang et al., 2016a 
Single gene 60%-83%% for embryos Park et al., 2017; Petersen et al., 2016; Wang et al., 
2015e; Yu et al., 2016 
50%—100% for offspring 
Monkey: 
Single gene No 10.2% for offspring Niu et al., 2014 
Sheep: 
Single gene No 37.4%-87.6% for embryos Crispo et al., 2015; Li et al., 2017b; Niu et al., 2017a; 
Zhang et al., 2017a 
59.1%-83.3% for offspring 
Goat: 
Single gene No 15%-28.6% Malpotra et al., 2017; Wang et al., 2015c; Zhou et al., 2017 
Zebrafish: 
Single gene No ~32.9% Ablain et al., 2015; Anelli et al., 2017; Fujii et al., 2016; 
Gallardo et al., 2015; Gui et al., 2017; He et al., 2015; 
Homma et al., 2017; Hoodless et al., 2016; Lee et al., 
2016; Narayanan et al., 2016; Perles et al., 2015; Shah et 
al., 2015; Varshney et al., 2015; Vejnar et al., 2016; Wang 
et al., 2014; Yuan et al., 2016a; Zhang et al., 2014 
Drosophila: 
Multiple genes No N/A Port et al., 2014 
Single gene Bassett et al., 2014; Gao et al., 2015; Wakabayashi et al., 2016 
Rabbit: 
Single gene No 98.7% for embryos Lv et al., 2016; Yuan et al., 2016b 
100% for offspring 
Mosauito: No N/A Dong et al., 2015 
NHEJ-based knockout via Mouse Yes 10%-90% Han et al., 2014; Kraft et al., 2015; Seruggia et al., 2015; 
deletion Wang et al., 2015a, 2017; Zhang et al., 2016 
NHEJ-based knock-in Zebrafish No 496—5496 Auer et al., 2014; Hisano et al., 2015; Kimura et al., 2014; 
Li et al., 2015a 
Frog No 896—1296 Shi et al., 2015 
Sheep No 34.7% Ma et al., 2017 
HDR-based knockout Drosophila No 47% Wen et al., 2016 
HDR-based knock-in Mouse: 
dsDNA No 10%-88% Aida et al., 2015; Chu et al., 2016; Guo et al., 2016; Han 
et al., 2015; Ishizu et al., 2016; Lewis et al., 2016; Li et al., 
2016; Mashiko et al., 2014; Wang et al., 2015b; Wu et al., 
2013 
ssODN 6926—669?6 Inui et al., 2014; Zhu et al., 2017 
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Continued 





Genome modifications and Species ESCs Efficiency** References 
targeting strategies involvement 
HDR-based knock-in Rat: 
ssODN No 7.7% Yoshimi et al., 2014 
Pig: 
dsDNA No 596—18.296 Peng et al., 2015 
ssODN 28.6%-80% Zhou et al., 2016 
Goat: 
ssODN No 24% Niu et al., 2017b 
Zebrafish 
dsDNA No 1.7%-3.5% Irion et al., 2014 
C. elegans: 
ssODN No 4.9% 62.8% Paix et al., 2016 


Drosophila No 


Frog No N/A 
MMEJ-based knock-in Zebrafish No N/A 
Mouse No 12% 


4.3%—-10.8% 


Li et al., 2015e; Lin & Potter, 2016; Liu et al., 2016; Ukken 
et al., 2016; Voutev & Mann, 2017; Yu et al., 2014 
Sakuma et al., 2016 


He et al., 2015 
Aida et al., 2016 


*: Data were converted into percentages, without normalization or additional statistical analysis. 
*: Data presented were based on records at the offspring stage, if stage not indicated. 


Enhanced genome editing via CRISPR-induced HDR 

HDR is a major DNA repair mechanism broadly employed in 
CRISPR-based genome editing (Heyer et al., 2010). In the 
presence of Cas9 nuclease and specific sgRNA targeting a 
selected sequence in the genome, site-specific DNA cleavage 
is introduced at the target genomic locus, which then will trigger 
DNA repair. When the target cells are given a large quantity of 
donor templates carrying homology sequences, HDR-based 
repair will utilize the donors as templates to repair the damaged 
genome, thus introducing foreign DNA included in the donor 
construct into the recipient genome (Heyer et al., 2010). 

The traditional gene targeting approach succeeded before 
the establishment of engineered nucleases. To accomplish 
sequence replacement in the genome, this approach relies on 
the HDR repair process triggered by spontaneous DNA damage 
that randomly occurs near target regions, (Koller et al., 1989). 
The desired targeting events occur at low frequency. Hence, 
successful genome targeting requires long homology arms in 
donor constructs, and needs sophisticated selection and clonal 
expansion in mouse ESCs before generating chimeric animals 
and genetically modified offspring (Koller et al., 1989; Thomas & 
Capecchi, 1987). It often takes more than one year to establish 
a knock-in or knockout strain of mouse. 

Site-specific DNA breaks trigger DNA repair around a target 
region. Hence, coupling this to the CRISPR system can greatly 
enhance the efficiency of HDR-based genome targeting and 
result in a high success rate of desired targeting. This 
improvement has bypassed the usage of ESC cells, allowing 
direct genome targeting in mouse zygotes (Yang et al., 2013). 
The direct genome targeting in zygotes via CRISPR-coupled 
HDR can produce a high percentage of chimeric animals and 
genetically modified mouse strains within 3-6 months, a much 
shortened period of time (Yang et al., 2013). Moreover, direct 


genome targeting in zygotes has also overcome the limitations 
of ESC unavailability, and made genome editing possible in 
many previously inaccessible organisms, such as pigs and 
monkeys (Peng et al., 2015). Furthermore, the introduction of 
site-specific DNA breaks allows the use of much shorter 
homology arms to achieve successful genetic modifications. 
Around 1000 bp homology fragments are usually sufficient, and 
around 100 bp single-stranded oligodeoxynucleotides (ssODN) 
carrying a 50-60 nt homology sequence at each side are 
effective in introducing small mutations/insertions to produce 
genetically modified animals (Inui et al., 2014; Zhou et al., 
2016). 

CRISPR-coupled HDR-mediated in vivo genome editing has 
been broadly used to introduce knock-in or knockout in the 
genome of various animal models for studying gene functions, 
modeling diseases, or developing novel treatment by correcting 
disease-associated mutations. Direct injection of Cas9 mRNA, 
sgRNA targeting only the mutant allele, and donor ssODN 
carrying a wild-type allele sequence into mouse zygotes 
carrying a heterozygous dominant-negative cataract-causing 
mutation in the Crygc gene resulted in cataract-free progeny 
(Wu et al., 2013). Besides rodents, large animals like pigs have 
also been used for disease modeling (Peng et al., 2015; Wang 
et al., 2015d; Zhou et al., 2016). In these studies, together with 
the use of the single blastocyst genotyping system and/or 
ssODN donors, researchers can assess sgRNA efficiency at 
the embryonic stage and achieve up to 80% targeting efficiency 
in producing animals carrying the desired genetic modification. 
Furthermore, successful targeting has also been reported in 
lower vertebrates and invertebrates (Irion et al., 2014; Li et al., 
2015e; Lin & Potter, 2016; Liu et al., 2016; Paix et al., 2016; 
Sakuma et al., 2016; Ukken et al., 2016; Voutev & Mann, 2017; 
Yu et al., 2014). Targeted gene modification and tagging has 
been achieved in Drosophila based on the CRISPR/Cas9- 
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coupled HDR approach (Li et al., 2015e; Lin & Potter, 2016; Liu 
et al., 2016; Ukken et al., 2016; Voutev & Mann, 2017; Yu et al., 
2014), with a similar method also applied in zebrafish, 
producing up to 50% targeted mutations in larvae (Irion et al., 
2014). With modified ssODN templates and CRISPR 
components, gene editing efficiency has reached 85% in C. 
elegans (Paix et al., 2016). Targeted genes or long noncoding 
RNA (IncRNA) can be precisely replaced with fluorescence 
reporters to deplete target genes by inserting visible markers 
(Platt et al., 2014; Wen et al., 2016). 


Gene correction in somatic tissues has also been performed 
using the CRISPR system and donor DNA (Table 2). Targeting 
of deficient ornithine transcarbamylase in the mouse model 
showed more than 10% correction of the deficient gene in liver 
cells and significantly improved the survival rate in target groups 
(Yang et al., 2016). Similarly, somatic correction of Duchenne 
muscular dystrophy (DMD) caused by a mutation in the gene 
encoding dystrophin has been reported, showing a 70% 
increase in functional dystrophin and apparent improvement in 
the mouse model (Bengtsson et al., 2017). 


Table 2 Summary of CRISPR-based in vivo genome editing in somatic tissues 





Genome modifications and Species Delivery system Efficiency** References 
targeting strategies 
NHEJ-based knockout via Mouse Virus 14.8%-86% Cheng et al., 2014; Chiou et al., 2015; de Solis et al., 2016; 
indel formation Ding et al., 2014; El Fatimy et al., 2017; Guo et al., 2017; 
Heckl et al., 2014; Hung et al., 2016; Kaminski et al., 2016; 
Kim et al., 2017a; Li et al., 2017a; Monteys et al., 2017; 
Ortinski et al., 2017; Tabebordbar et al., 2016; Wang et al., 
2015a, 2016b; Yin et al., 2017 
Hydrodynamic injection N/A Liang et al., 2017; Weber et al., 2015; Xue et al., 2014 
Electroporation N/A Kalebic et al., 2016; Latella et al., 2016; Maresch et al., 
2016; Shinmyo et al., 2016; Straub et al., 2014 
Cell injection 90% Courtney et al., 2016; Katigbak et al., 2016; Wu et al., 2017 
Chicken Electroporation N/A Véron et al., 2015 
NHEJ-based knockout via Mouse Virus N/A Long et al., 2016; Nelson et al., 2016 
deletion Hydrodynamic injection 30% Pankowicz et al., 2016 
HDR-based knockout Mouse Virus 85% Platt et al., 2014 
HDR-based knock-in Mouse Virus 2.396—696 Bengtsson et al., 2017; Xie et al., 2016; Yang et al., 2016; 
Yin et al., 2016 
Electroporation Chen et al., 2016 
Cell injection Ou et al., 2016 
MME Lbased knock-in Mouse Virus 20% Yao et al., 2017 
NHEJ-based knock-in Mouse Virus 3.4%-10% Suzuki et al., 2016 
Chromosomal rearrangement Mouse Virus N/A Blasco et al., 2014; Maddalo et al., 2014 


*: Data were converted into percentages, without normalization or additional statistical analysis. 
*: Data presented were based on records at somatic tissue level, if stage not indicated. 


Diverse targeting strategies through CRISPR-induced 
NHEJ-mediated DNA repair 

Double-strand DNA breaks due to the disruption of 
phosphodiester bonds between adjacent nucleotides in double- 
helix DNA. While HDR repairs a broad range of DNA damage, 
NHEJ is the primary mechanism for repairing DSBs in 
mammalian cells. With site-specific DSBs able to be introduced 
at almost any target site in the genome with high efficiency and 
accuracy using the CRISPR system, the NHEJ repair 
mechanism has been broadly employed to introduce random 
mutations at selected target sites. This CRISPR-coupled NHEJ- 
based mutagenesis approach can disrupt protein coding 
potential of a target gene by causing frame shift or premature 
termination, and therefore deplete functional proteins and 
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introduce loss-of-function effects (Figure 1). To date, most 
animal models established using CRISPR technology have 
employed this strategy to knockout a specific gene, especially 
model organisms that are incompatible with the traditional HDR- 
based strategy, such as zebrafish or Xenopus (broadly noticed 
via personal communications) (Table 1 and 2) (Auer & Del 
Bene, 2014; Irion et al, 2014; Won & Dawid, 2017). 
Furthermore, due to its simple principles and procedures, 
CRISPR-NHEJ-based mutagenesis has been applied in high- 
throughput studies. Xu et al. reported successful loss-of- 
function screening to identify genes essential to tumorigenesis 
in mice using pre-constructed sgRNA libraries (Xu et al., 2017). 
Interestingly, in vivo application of a sgRNA library has also 
been reported in zebrafish (Shah et al., 2015). Combining 
CRISPR-based high-throughput screening with excellent 


accessibility to embryonic development, ` straight-forward 
phenotyping has allowed large scale analysis of gene function. 
Shawn M. Burgess and colleagues have verified more than 50 
genes by this method (Varshney et al., 2015), and Stefania 
Nicolia’s team has succeeded in a similar screening using the 
SgRNA pool-targeting miRNA family (Narayanan et al., 2016). 

In addition, NHEJ repair has been found to be highly efficient 
in re-ligating DNA ends from DSBs concurrently produced by 
the CRISPR system at two different genome loci, despite the 
long distance in genome. In support of these observations, the 
CRISPR-coupled NHEJ repair mechanism has also been 
employed to delete selected large DNA fragments by targeting 
two regions in the same chromosome (Dow et al., 2015; Han et 
al., 2014; Wang et al., 2015b) or catalyzing the desired genomic 
rearrangements by targeting two selected regions from different 
chromosomes (Blasco et al., 2014). These strategies have 
succeeded in generating mouse models carrying a 353-kb 
intragenic deletion of Laf4, which recapitulates a human 
malformation syndrome (Kraft et al., 2015), and engineering 
mouse models that harbor chromosomal rearrangements 
recurrently found in lung cancer to model carcinogenesis 
(Blasco et al., 2014; Maddalo et al., 2014). The functional study 
of IncRNA genes is another important application of NHEJ- 
mediated large fragment deletion. Knockout of the IncRNA gene 
Rian through a large deletion of up to 23 kb demonstrated 
efficiency as high as 33% (Han et al., 2014) can be achieved, 
with similar results reported for the tyrosinase (Tyr) associated 
IncRNA gene (Seruggia et al., 2015). 

Rather strikingly, CRISPR-coupled NHEJ repair has also 
enabled high-efficiency knock-in of exogenous DNA at pre- 
selected locations. This is consistent with common observations 
that NHEJ is the predominant repair mechanism in mammalian 
cells. Since the early 1980s, transgenic technology has been 
established and applied broadly to render stable ectopic 
expression by introducing foreign DNA fragments carrying 
complete gene cassettes into host genomes (Palmiter et al., 
1982). Later studies have found that the NHEJ repair 
mechanism is responsible for capturing foreign DNA fragments 
at spontaneously occurring DSBs in the genome, resulting in 
random integrations (Lin & Waldman, 2001). Consistently, 
traditional gene targeting studies have also shown that the 
frequency of random DNA integration via the NHEJ repair 
mechanism is significantly higher (over 1 000-fold) than targeted 
insertion mediated by the HDR pathway (Vasquez et al., 2001). 
Due to the unavailability of programmable site-specific 
nucleases and their erroneous nature, the potential of the NHEJ 
mechanism in targeted DNA knock-in was largely neglected for 
a long time. 

Until recently, after ZFN was successfully established, short 
oligonucleotides (<100 bp) were able to be inserted efficiently at 
ZFN-induced DSBs via NHEJ repair (Orlando et al., 2010). 
Subsequently, inclusion of a ZFN or TALEN target sequence in 
donor vectors showed that simultaneous cleavage of donor and 
genome DNA could enable targeted integration via NHEJ repair 
(Cristea et al., 2013; Maresca et al., 2013). Using promoterless 
fluorescence reporters followed by direct quantification using 
fluorescence-activated cell sorting (FACS), we compared the 


frequencies of NHEJ- and HDR-mediated knock-in after 
coupling with the CRISPR system (He et al., 2016). We found 
that knock-in via CRISPR/Cas9-induced NHEJ is superior to 
the commonly used HDR-based method in all human cell lines 
examined (He et al., 2016). This NHEJ-based knock-in 
approach has been applied in precise reporter knock-in in 
zebrafish (Auer et al., 2014; Hisano et al., 2015; Irion et al., 
2014; Kimura et al., 2014; Li et al., 2015a) and Xenopus (Shi et 
al., 2015), with such gene targeting previously impeded by the 
deficiency of the HDR pathway. More recently, CRISPR/Cas9- 
induced NHEJ has been shown to mediate high efficiency 
knock-in in mouse somatic tissues (Suzuki et al., 2016), but 
success in targeting zygotes or blastocysts to generate 
genetically modified mice has not yet been reported. 

Through CRISPR-coupled NHEJ repair, various genome 
targeting strategies have been established and utilized in 
generating genetically modified animal models. From studies 
published since early 2013, 75.9% (110/145) of in vivo genome 
editing studies have employed NHEJ-based targeting strategies. 
Extensive evidence has shown that NHEJ-based genome 
targeting is simpler, more flexible, and more efficient compared 
with HDR-based approaches. Without homology sequences 
involved, the design and system construction for NHEJ-based 
strategies are less laborious. On the other hand, however, the 
random nature of NHEJ repair incurs disadvantages including 
the unpredictability of indel-based mutagenesis as well as off- 
target cleavage and insertion. 


Genome editing by CRISPR-induced MMEJ repair 

Distinct from NHEJ and HDR, the two common forms of DNA 
repair, MMEJ requires microhomologous sequences of only 5— 
25 bp for the repair of DSBs in DNA. Sakuma et al. devised a 
detailed protocol for CRISPR-based gene knock-in using 
MMEJ, termed Precise Integration into Target Chromosomes 
(PITCh) (Sakuma et al., 2016). 

In this system, DSBs are needed in both the genomic DNA 
and donor vector to insert a DNA fragment from the donor into 
the genome. As MMEJ repair requires the presence of 
microhomology both upstream and downstream of the DSB 
site, two microhomologous sequences need to be added to the 
donor vector at both sides of the purpose sequence (Sakuma et 
al., 2016). For the CRISPR system, two sgRNAs are required to 
generate DNA cleavages near the microhomology sequences 
on both sides, while one sgRNA is used to induce DSBs on the 
genome DNA (Figure 2). Longer microhomologies of around 20 
bp are currently used to improve accuracy. After alignment 
between microhomologous sequences, the unmatched non- 
homologous sequences at the 3'-parts on both sides of the 
donor appear as single-strand tails and are removed. This 
results in the loss of a small part of the genome sequence at 
the target sites. Therefore, MMEJ-based genome editing is 
associated with deletion/insertions that are often larger than 
NHEJ-introduced indels (Villarreal et al., 2012). 

Targeted integration mediated by CRISPR-coupled MMEJ 
has been demonstrated in cultured cells and the generation of 
genetically modified zebrafish (He et al., 2015; Hisano et al., 
2015; Nakade et al., 2014). Moreover, one-step knock-in of 
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gene cassettes and floxed alleles has also been achieved in 
human cells and mouse zygotes through MMEJ (Aida et al., 
2016). Recently, precisely targeted gene integration in somatic 
tissues to correct mutation of the Fah gene and rescue liver 
failure in Fah-/- mice has also been demonstrated (Yao et al., 
2017). 


Comparison between different targeting strategies 

Conventional NHEJ repair does not require the presence of 
homology sequences and involves minimal processing of DNA 
broken ends. The activity of the NHEJ pathway is high and 
stable throughout the cell cycle. Distinctly, the HDR repair 
mechanism relies on long homology sequences (> 500 bp in 
general) to repair DNA lesions, and is only active from the late 
S phase to G2 phase during the cell cycle. The MMEJ pathway 
depends on microhomology sequences (5—25 bps) for DSB 
repair and is active during the M to early S phase (Taleei & 
Nikjoo, 2013). These differences explain why the activities of 
the different DNA repair pathways vary in different cell contexts. 

The intrinsic activities of the two major pathways, HDR and 
NHEJ, also vary in different species, despite high conservation 
of these pathways across a broad range of organisms. Lower 
vertebrates, such as zebrafish and Xenopus, are deficient in 
HDR-based DNA repair. Hence, modification of genome 
sequences in these models has mainly succeeded with NHEJ- 
based strategies, such as transgenesis, indel-based targeted 
mutagenesis/deletion, or the recent knock-in approach based 
on coupling TALEN- or CRISPR-induced DNA cleavage to the 
NHEJ repair mechanism (Auer et al., 2014; Hisano et al., 2015; 
Irion et al., 2014; Kimura et al., 2014; Li et al., 2015a; Shi et al., 
2015) (Table 1). In mammalian systems, although HDR was 
first employed to produce genetically modified mice, evidence 
shows that the NHEJ repair mechanism is predominant 
(Vasquez et al., 2001). Thus, the efficiency of NHEJ-based 
genome editing is generally superior to HDR-based approaches 
(He et al., 2016). 

Scientists have attempted to manipulate the balance between 
the HDR and NHEJ pathways. Through inhibiting DNA ligase IV, 
a key component of the NHEJ pathway, studies have shown 
that the efficiency of HDR-based gene targeting can be 
increased substantially (Chu et al., 2015). Similarly, silencing 
KU70, KU80, or DNA ligase IV largely suppressed NHEJ- 
mediated introduction of indels at the junction and enhanced 
HDR-mediated genome editing (Pierce et al., 2001). To date, 
this type of approach has not been applied for in vivo gene 
targeting. 

Besides efficiency, accuracy is another major concern. The 
HDR-based targeting strategy requires homology sequences as 
a template for DNA replication to repair induced DNA cleavage. 
It involves the cloning of homologous DNA and multi-step 
construction of donor plasmids. In return, the designed 
modifications can be introduced into the genome with high 
accuracy and off-target integrations can be largely reduced 
compared to other knock-in strategies. MMEJ-based targeting 
requires microhomologous sequences, which can be easily 
introduced into donor vectors through synthesized oligos, or 
during PCR amplification of the desired DNA for insertion. 
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Although the intrinsic MMEJ pathway often plays a minor role in 
overall DNA repair, the MMEJ-based targeting strategy has 
shown efficiency up to 10-fold higher than that of the HDR- 
based approach (Yao et al., 2017). Lastly, the NHEJ repair 
mechanism, which is completely independent of any homology 
sequences, offers the easiest path to modify an existing design 
for a new target site in the genome. In our recent study, a 
universal donor was established with the use of artificial sgRNA 
which did not target any sequence in mice and humans (He et 
al., 2016). With the minimum work involved in constructing the 
new sgRNA to the genome, the whole system was easily 
orientated for targeting a new locus (He et al., 2016). However, 
the random errors potentially present at the integration/ repair 
junctions with NHEJ-based targeting approaches should be 
considered during the design. 


CONCLUSIONS 


The recent advent of CRISPR technology has offered the 
simplest and possibly ultimate solution for introducing site- 
specific DSBs in genome DNA, which was once an 
insurmountable challenge in genome editing. Through coupling 
with different DNA repair mechanisms present in the 
endogenous cellular system, various targeting strategies have 
been developed to introduce a wide range of modifications in 
the genome through sequence-based editing. While further 
research is needed to evaluate the off-target issues and 
overcome the risks by developing improved CRISPR systems, 
the above technological advances have undoubtedly revolutionized 
biomedical research. The CRISPR-based genome editing 
approaches have significantly promoted studies on gene 
function via the rapid generation of animal models that carry 
genetic deficiencies of single or multiple genes. In addition, they 
have also enabled modeling of genetic diseases caused by 
chromosomal rearrangement or large deletions. Therefore, 
rapid progress could be foreseen in establishing various animal 
models for disease modeling or therapeutic intervention, which 
will significantly improve our understanding of human diseases 
and promote the development of new therapeutic strategies. 
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ABSTRACT 


Acute kidney injury (AKI) and chronic kidney disease 
(CKD) are worldwide public health problems 
affecting millions of people and have rapidly 
increased in prevalence in recent years. Due to the 
multiple causes of renal failure, many animal models 
have been developed to advance our understanding 
of human nephropathy. Among these experimental 
models, rodents have been extensively used to 
enable mechanistic understanding of kidney disease 
induction and progression, as well as to identify 
potential targets for therapy. In this review, we 
discuss AKI models induced by surgical operation 
and drugs or toxins, as well as a variety of CKD 
models (mainly genetically modified mouse models). 
Results from recent and ongoing clinical trials and 
conceptual advances derived from animal models 
are also explored. 


Keywords: Acute kidney injury; Chronic kidney 
disease; Mouse models; Transgenic mice 


INTRODUCTION 


Acute kidney injury (AKI) and chronic kidney disease (CKD) are 
linked to high morbidity and mortality. AKI is regarded as a rapid 
and reversible decline in renal function and is associated with 
accelerated CKD (Siew & Davenport, 2015). The ability to 
diagnose AKI has progressed significantly. Recent consensus 
diagnostic criteria include an increase in serum creatinine 20.3 
mg/dL (226.5 umol/L) within 48 h; an increase in serum 
creatinine to 21.5 times baseline; or urine volume «0.5 mL/kg/h 
for 6 h (Khwaja, 2012). Many risk factors such as drugs/toxins, 
sepsis, and ischemia-reperfusion (IR) commonly result in AKI 
and lead to reduced glomerular filtration rate (GFR) as well as 
acute tubular cell death (Sanz et al., 2013). CKD is a significant 
medical problem globally, with a rapid increase in incidence due 
to the rise in hypertension and diabetes (Tomino, 2014). CKD is 
usually diagnosed by the presence of albuminuria or estimated 
GFR from serum creatinine «60 mL/min/1.73 m? (Andrassy, 2013). 
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There is increasing recognition that AKI and CKD are closely 
linked and are therefore regarded as an integrated clinical 
syndrome (Chawla & Kimmel, 2012) (Figure 1). Key biological 
processes such as cell death, cell proliferation, inflammation, 
and fibrosis, as well as common biomarkers, are detected in 
both kinds of nephropathy (Andreucci et al., 2017; Endre et al., 
2011). Generally, tubular cell death, which includes necrosis, 
apoptosis, or necroptosis, is the main histological feature in 
early stage AKI, whereas fibrosis tends to occur under CKD. An 
increasing number of studies have shown that AKI is a major 
risk factor that can accelerate CKD progression (Pannu, 2013). 
Clinical observations have also found a strong relationship 
between AKI and CKD. Compared to patients with no history of 
AKI or CKD, AKI patients are more likely to develop new CKD 
or end-stage renal disease (ESRD) (Chawla et al., 2014; 
Chawla & Kimmel, 2012). Conversely, CKD also plays an 
important role in AKI. Patients with CKD may suffer higher risk 
of transient decreases in renal function consistent with AKI 
(Chawla et al., 2014). The underlying mechanism that results in 
acute renal dysfunction may involve decreased GFR, increased 
proteinuria, renal auto-regulation failure, and drug side effects 
(Hsu & Hsu, 2016). 

Mechanisms of disease generation and progression in AKI 
and CKD remain incompletely understood (Singh et al., 2013; 
Tampe et al., 2017). Although several clinical studies have 
investigated early stage predictive biomarkers of kidney disease, 
few has been applied in clinical practice (Endre et al., 2013; 
Francoz et al., 2016; Peng et al., 2008; Soto et al., 2010). Our 
group identified urinary fractalkine as a marker of acute kidney 
transplant rejection (Peng et al., 2008). However, considerable 
challenges still lay ahead for the design and implementation of 
clinical kidney disease trials. Large sample size and long follow- 
up duration are essential in a multicenter clinical trial to 





Received: 15 July 2017; Accepted: 05 September 2017 

Foundation items: This study was supported by the General Program 
of the National Natural Science Foundation of China (51309220, 
31470776) and QianJiang Talent Plan to W.Q. Lin 

"Corresponding author, E-mail: wlin@zju.edu.cn 
DOI:10.24272/j.issn.2095-8137.2017.055 


Zoological Research 39(2): 72-86, 2018 


Interstitial fibrosis 
Failed differentiation 
Sustained inflammation 


Risk factors Outcomes 





macs f ) => EE Kidney events 
Jenetic factors | ™ » Dp kidney à ki Chronic => Pepe 
Hypertension isease idney disease 





Diabetes mellitus ^ Decreased GFR 
Increased proteinuria 
Renal reserve loss ——— —— 
Autoregultation failure 
Side effect of drugs 
Figure 1 Relationship between acute kidney injury (AKI) and chronic kidney disease (CKD) 
AKI and CKD show some common risk factors such as age, race, and hypertension. Persistent AKI can induce interstitial fibrosis, failed cell 
differentiation, and finally CKD. Conversely, CKD also plays an important role in the development of AKI. Both are associated with an increased risk of 
death and can result in serious kidney events such as end-stage renal disease. 


A 
LPS model 
Sepsis induced 
CLP model 
Obstructive AKI: UUO 
AKI Ischemia-reperfusion 
model Cisplatin 
Folic acid 
Warfarin 
Drug/Toxin induced AKI Glycerol-induced 
Aristolochic acid nephropathy 
Gentamicin nephropathy 
B 
Renal mass reduction: 5/6 nephrectomy 
Hypertension-induced renal injury 
Diabetic nephropathy FSGS 
: Crescentic glomerulonephritis 
Primary glomerular nephropathy 
Membranous nephropathy 
CKD IgA nephropathy 
gA nephropathy 


Amyloid A (AA) amyloidosis 
Secondary nephrotic syndrome 


Systemic lupus erythematosus (SLE) 


Polycystic kidney disease 
Hereditary nephritis | (PKD) 
Alport syndrome 
Figure 2 Summary of major acute kidney injury (AKI) and chronic kidney disease (CKD) models 
A: Common causes of acute kidney injury (AKI). Traditional animal modeling approaches are included. B: Classification of chronic kidney disease 
(CKD). The following are available animal models for CKD; UUO: Unilateral ureteral obstruction; FSGS: Focal segmental glomerulosclerosis. 


guarantee the quality, efficiency, and safety of intervention and total mortality in kidney disease (Di Lullo et al., 2015; Palsson & 
treatments as there are many different types and causes of Patel, 2014; Ross & Banerjee, 2013), making it difficult to 
kidney disease and treatment can be protracted (Luyckx et al., determine the major cause and best treatment. Thus, mature 
2013). Moreover, serious complications greatly contribute to animal models are an indispensable part of scientifically 
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designed kidney disease studies, and play an important role in 
resolving the bottleneck issue in treatment. 

Animal models have been extensively used to clarify the 
pathogenesis and underlying mechanisms of renal disease. 
Among these models, mice and rats are the most commonly 
used to study nephropathy events and potential therapeutic 
targets and to identify specific biomarkers of disease. Mice and 
rats are easily bred and are relatively inexpensive to house and 
maintain (Wei & Dong, 2012). Classic acute kidney disease can 
be induced in a variety of murine models by surgery or 
administration of drugs or toxins (Ortiz et al., 2015; Singh et al., 
2012). Furthermore, genetically engineered mice and inbred 
strains provide a new platform for investigating complex human 
nephropathy (such as IgA nephropathy and diabetic nephropathy) 
(Marchant et al., 2015; Suzuki et al., 2014). In this review, we 


focused on murine models of AKI and CKD (Figure 2). 
ACUTE KIDNEY INJURY MODELS 


Recently, several reviews of available models, including their 
advantages and disadvantages, have been discussed (Ortiz et 
al., 2015; Ramesh & Ranganathan, 2014); however, the types 
of models are incomplete and many details, such as model 
techniques and modeling time, are not mentioned. Current 
models of AKI can be induced by IR (pre-renal acute kidney 
failure), injection of drugs, toxins, or endogenous toxins (sepsis- 
associated AKI), and ureteral obstruction (post-renal acute 
kidney failure) (Sanz et al., 2013; Singh et al., 2012) (Table 1). 
This section will discuss experimental AKI models, surgical 
operations, model time courses, and drug/toxin dose ranges. 


Table 1 Comparison of conventional acute kidney injury (AKI) mice models 





Models Species Time-course/Dose range Advantages Disadvantages References 
Sepsis Rats/mice ` Cecal ligation and punctured Simple; inexpensive; Variable response between models; Dejager et al., 2011; 
induced to induce AKI; single i.p. standardized dose of expected acute renal necrosis is not Liu et al., 2015; Liu 
dose of 10-15 mg/kg LPS LPS always achieved; AKI is not et al., 2016 
are commonly used to produced clinically and 
induced AKI pathologically 
Ischemia- Rats/mice Ischemia time: 30—45 min; High clinical relevance; Surgery requires; reproducible Hesketh et al., 2014; 
reperfusion reperfusion time: 24—48 h classical model with outcome dependent on accurate Wei & Dong, 2012 
high knowledge Ischemia/ Reperfusion time 
background 
UUO Rats/mice — 1—2 weeks; longer time for Technically simple; Surgery requires; not widely used Bander et al., 1985; 
renal fibrosis studies reproducible as AKI mode; renal function can De Chevalier et al., 
compensated by the non-ligated 2009; Ucero et al., 
kidney; 2014 
Cisplatin Rats/mice ` Single does 6-20 mg/kg; Simple and Requires higher does to induce AKI; Ko etal., 2014; Li et 
cisplatin within 72 h to reproducible; similar to the cisplatin use is decreased in al., 2005; Morsy & 
induce AKI human renal disease clinical Heeba, 2016; Xu et 
al., 2015 
Aristolochic ` Rats/mice 5 mg/kg/day Aristolochic acid Useful to study AKI- No clinical correlate; less Matsui et al., 2011; 
acid for 5 days CKD transition; nephrotoxicity report Wu et al., 2014 
Folic acid Rats/mice Single dose of 250 mg/kg Simple and Wen et al., 2012; 
induce AKI in 24-48 h reproducible Soofi et al., 2013 
Warfarin Rats 5/6 nephrectomy for 3 weeks  Clinically relevant; Only modeled in rats Brodsky, 2014; 
and 8 days on warfarin useful to study AKI Ozcan et al., 2012; 
caused by Ware et al., 2011 
anticoagulants 
Glycerol Rats/mice Deprived of water for 24 h Simple; reproducible Severe pathology Geng et al., 2014; 
and single injection of 8—10 Kim et al., 2014 
mg/kg 50% glycerol 
Gentamicin Rats Dose range 40-200 mg/kg Highly relevant; Requires higher dose of Boroushaki et al., 


for 4-10 days 


reversible AKI 


Gentamicin; different symptoms in 


human and rodents 


2014; He et al, 
2015; Heidarian et 


Sepsis-associated AKI 

Sepsis-associated AKI (SA-AKI) is characterized by severe 
inflammatory complications and high morbidity and mortality 
(Swaminathan et al, 2015). Frequently used experimental 
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al., 2017 


models of SA-AKI can be divided into two types: (1) injection of 
bacteria or endogenous toxins (e.g., LPS) into the peritoneum 
or blood; and (2) release of intestinal excreta by cecal ligation 
and puncture (CLP) or colon ascendens stent peritonitis (CASP) 
(Xu et al., 2014; Liu et al., 2015a). 


LPS models LPS-induced AKI has mainly been studied in 
rats and mice. Compared with other species, rodents are 
significantly more resistant to the toxic or lethal effects of LPS. 
The dose of LPS commonly used in research is 10-15 mg/kg 
(Fink, 2014; Liu et al., 2015b; Venkatachalam & Weinberg, 
2012). After LPS interacts with specific receptors such as Toll- 
like receptor 4 (TLR-4) (Solov'eva et al., 2013) on host immune 
cells, inflammatory cytokines like IL-1, TNF-a, and IL-6 are 
secreted, leading to hemodynamic alteration, widespread 
inflammation, and sepsis (Solov'eva et al., 2013). This is an 
acute model that usually terminates at 72-96 h. 

CLP model The CLP model is the most frequently used 
model due to its simplicity. Firstly, ligation of the cecum from the 
distal to the ileocecal valve is made. After that, two needle 
punctures are made to extrude stool into the abdominal cavity 
(Liu et al., 2016b; Poli-De-Figueiredo et al., 2008). CLP in mice 
can develop the typical symptoms of bacterial peritonitis 
observed in humans and yield good results (Fink, 2014). 
However, it is difficult to control the severity of sepsis and the 
differences in age and strain in CLP models (Zarjou & Agarwal, 
2011). Moreover, reproducible AKI cannot be developed in a 
CLP model (Dejager et al., 2011). 

Although experimental models have extended our 
understanding of sepsis and sepsis-associated AKI, there is still 
no effective clinical therapy (Alobaidi et al., 2015). Several 
Clinical trials targeting specific signaling pathways based on 
convincing results in murine models have failed to improve 
survival in septic patients. 


Ischemia-reperfusion (IR) model 

Currently, IR is the most widely used model for clinical AKI and 
renal transplant studies (Hesketh et al., 2014). Among the 
variety of existing models, the mouse clamping model is often 
applied due to its low costs and choice of transgenic models 
(Sanz et al., 2013). According to previous studies, commonly 
used models contain bilateral renal IR (Huang et al., 2012; Hu 
et al., 2010; Kim et al., 2012) and unilateral renal IR (Braun et 
al., 2012; Chen et al., 2011; Gall et al., 2011). 

First, 50-60 mg/kg of pentobarbital (b mg/mL) is used to 
anesthetize mice by intraperitoneal (i.p.) injection, with body 
temperatures then maintained at 36.5-37 °C during surgery. 
Second, the renal artery and vein are clamped by micro- 
aneurysm clips for a variable length of time to induce different 
severities of kidney injury. In general, clamping the renal 
pedicle for 30 min is used to induce IR injury (Huang et al., 
2015; Liu et al., 2016a). Successful ischemia can be confirmed 
by gradual darkening of the kidney (from red to dark purple). The 
clamp is then removed at the desired time to achieve reperfusion, 
with the kidney color immediately reverting to red (Hesketh et al., 
2014; Wei & Dong, 2012). Ischemia-reperfusion will trigger 
tubular cell necrosis and apoptosis, inflammation, and oxidative 
stress (Rovcanin et al., 2016; Sanz et al., 2013; Zhou et al., 
2015), which can result in a decline of renal function, as 
evaluated by blood urea nitrogen (BUN) and serum creatinine. 
Despite the view that the IR model is less stable, experimental 
factors such as anesthesia dose, mouse strain, age, gender, and 
feeding conditions can be well-controlled (Wei & Dong, 2012). 


Obstructive AKI 

Unilateral ureteric obstruction (UUO) is the most common 
rodent model used to study AKI and CKD (Ucero et al., 2014). 
This model can result in hydronephrosis and blood flow 
changes. Ischemia, hypoxia, and oxidative stress (Dendooven 
et al., 2011) contribute to the tubular cell death, followed by 
interstitial inflammation. Additionally, transformed fibroblasts 
can interact with extracellular matrix deposition to cause renal 
fibrosis (Xiao et al., 2016; Zhou et al., 2014). 

Recent studies using the UUO model have shown that 
adenosine levels (Tang et al., 2015), nuclear factor-erythroid-2- 
related factor 2 (Nrf2) (Chung et al., 2014), interleukin-10 (Jin et 
al., 2013), and the JAK/STAT signaling pathway (Koike et al., 
2014) are related to renal fibrosis, thus offering a potential 
therapeutic target for renal injury. The UUO model is relatively 
straightforward. Male animals, which are recommended in this 
model, undergo a midline abdominal incision under anesthesia, 
with the left ureter then ligated with 4—0 silk. After 24 h, the 
ureter obstruction is removed (Bander et al., 1985). Different 
from the complete UUO model, a partial UUO is created by 
inserting the ureter into a surgically created tunnel in the psoas 
muscle (Sugandhi et al., 2014). Reversible partial UUO is 
generally performed in neonatal mice to investigate kidney 
recovery after obstruction (Ucero et al., 2014). However, the 
complete UUO model is more popular because it is less 
technical and more easily reproduced. 


Toxin-induced AKI 

Exogenous drugs or poisons and endogenous toxins are used 
to stimulate AKI by their side or poisoning effects. Among these 
models, 6-20 mg/kg cisplatin can result in acute tubular injury 
within 72 h, whereas administration of 40-200 mg/kg 
gentamicin in rats for 4—10 d can induce acute renal failure. 
Aristolochic acid and high dose folic acid (FA) are frequently 
used to study AKI-CKD transition, with AKI models developed 
by warfarin and glycerol also used. 


Cisplatin-induced AKI 

Cisplatin is a chemotherapy agent that is widely used in the 
treatment of solid tumors (Karasawa & Steyger, 2015). 
However, high doses of cisplatin can induce prominent 
nephrotoxicity in humans (Humanes et al., 2012; Malik et al., 
2015). Among cisplatins adverse effects, direct proximal 
tubular toxicity is significant. Tubular cell necrosis and apoptosis 
are mediated by inflammation, oxidative stress, and calcium 
overload. These modes of cell death both lead to increased 
vascular resistance and decreased GFR (Ozkok & Edelstein, 
2014). The pathology and recovery phase of cisplatin-induced 
AKI models are comparable with those of humans. Many 
studies have reported that single i.p. injection of 6—20 mg/kg 
cisplatin can induce AKI within 72 h in rodent models (Ko et al., 
2014; Lee et al., 2009; Morsy & Heeba, 2016; Xu et al., 2015). 
Furthermore, other groups have developed AKI models by 
injecting higher doses of cisplatin, including 30 mg/kg, i.p. (Lu et 
al., 2008; Mitazaki et al., 2009) and 40 mg/kg, i.p. (Zhang et al., 
2016). Based on results from this experimental model, several 
therapeutic targets have been established. 
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Aristolochic acid nephropathy 

It has been reported that i.p. injection of aristolochic acid (AA) 
(5 mg/kg/d for 5 d) can induce AKI (Matsui et al., 2011; Wu et 
al., 2014). The pathology of acute aristolochic acid nephropathy 
(AAN) involves proximal tubular cell injury and necrosis with 
oxidative stress and progressive interstitial renal fibrosis 
(Baudoux et al., 2012; Nortier et al., 2015; Yang et al., 2010). 
Rabbit and rat models were first used to recapitulate human 
CKD and confirmed that aristolochic acid is related to Chinese 
herb nephropathy and Balkan endemic nephropathy (De Broe, 
2012; Sanz et al., 2013). Recently, studies on AA in AKI-CKD 
transition have increased. Signaling pathways such as nuclear 
factor erythroid 2-related factor 2 (Nrf2) (Wu et al., 2014) and 
Jun N-terminal kinases (JNK) signaling (Rui et al., 2012; Yang 
et al., 2010) have been shown to play important roles in AA- 
induced acute kidney lesions, thus providing several new 
therapeutic targets. 


Folic acid-induced AKI 

A high dose of FA can also induce AKI in mice (Wen et al., 
2012). Intraperitoneal injection of 250 mg/kg of FA (dissolved in 
0.3 mmol/L NaHCOs) can cause acute renal toxicity and injury 
in rodents (Soofi et al., 2013; Wen et al, 2012). The 
mechanism of FA nephropathy might be due to FA crystal 
deposition in the tubular lumen, which results in obstruction and 
extensive necrosis (Kumar et al., 2015; Szczypka et al., 2005). 
A more recent study showed that inhibition of ferroptosis can 
protect kidneys from FA-induced AKI, implicating its important 
role in FA nephropathy (Martin-Sanchez et al., 2017). 
Additionally, mitochondrial dysfunction and early renal fibrosis, 
which are related to CKD pathology, can be found in the FA- 
induced AKI model, thus providing a new way in which to 
investigate AKI-CKD transition (Stallons et al., 2014). 


Warfarin-induced AKI 

A new model of warfarin-induced hematuric AKI based on 5/6 
renal nephrectomized rats (Ware et al., 2011) was established 
to study the pathology of warfarin-related nephropathy (WRN) 
in patients with excessive anticoagulant (Rizk & Warnock, 
2011). The 5/6 nephrectomy was performed in Sprague Dawley 
rats, with animals allowed three weeks recovery from the 
surgery before warfarin treatment. Warfarin was given orally via 
drinking water, and warfarin dosage was based on rat weight 
(Brodsky, 2014). Extensive glomerular hemorrhage and tubular 
obstruction can occur in rats after seven days administration of 
warfarin (0.4 mg/kg/d), as well as increased serum creatinine 
(Brodsky, 2014; Ozcan et al., 2012). Besides, WRN can also 
induce AKI, accelerate CKD, and increase the mortality rate in 
warfarin-treated patients (Brodsky et al., 2011). However, the 
mechanism and therapeutic strategies to ameliorate WRN- 
induced AKI remain to be demonstrated. 


Glycerol-induced AKI 

Rhabdomyolysis is a syndrome in which the breakdown of 
skeletal muscle leads to the release of intracellular proteins and 
toxic compounds into circulation (Hamel et al., 2015). AKI is a 
common complication of rhabdomyolysis and accounts for the 
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high mortality (Elterman et al., 2015; Zhang et al., 2012). 
Presently, oxidative damage and inflammation are the two 
major causes of rhabdomyolysis-induced AKI (Tomino, 2014). 
To reproduce the typical symptoms observed in humans, rats or 
mice are deprived of water for 24 h, after which a 8-10 mL/kg 
dose of 50% glycerol is administrated in the hindlimb muscle 
(Geng et al., 2014; Kim et al., 2014b). Although studies have 
reported that vitamin C (Ustundag et al., 2008), L-carnitine 
(Aydogdu et al., 2006; Ustundag et al., 2009), and resveratrol 
(Aydogdu et al., 2006) can ameliorate rhabdomyolysis-induced 
AKI, there is currently no effective therapy for this disease 
except aggressive rehydration (Gu et al., 2014). 


Gentamicin nephropathy 

Gentamicin is an aminoglycoside antibiotic commonly used to 
prevent gram-negative bacterial infection. Nevertheless, 
nephrotoxicity limits its use in clinical practice (He et al., 2015). 
Doses of gentamicin ranging from 40-200 mg/kg administered 
for 4-10 d (Bledsoe et al., 2008; Boroushaki et al., 2014; 
Heidarian et al., 2017; Jabbari et al., 2011) can induce acute 
renal failure in rats. Administration of 100 mg/kg i.p. for 5 d is 
recommended to mimic gentamicin-induced nephrotoxicity (Hur 
et al., 2013; Stojiljkovic et al., 2008; Stojiljkovic et al., 2012). 
This acute model is characterized by increased levels of serum 
urea and creatinine, decreased GFR, tubular lesions, and 
fibrosis (Romero et al, 2009; Al-Shabanah et al. 2010; 
Balakumar et al., 2010). 


CHRONIC KIDNEY DISEASE MODELS 


CKD models mainly include diabetic/hypertensive nephropathy, 
glomerular injury, polycystic kidney disease (PKD), and chronic 
tubulointerstitial nephritis (Table 2). In this section, key 
information on various rodent models of CKD is discussed. 


Renal mass reduction 

The remnant kidney model has been one of most commonly 
used experimental models of CKD. The 5/6 subtotal 
nephrectomy approach is widely used to mimic human CKD in 
rats. The right kidney is removed and the upper and lower poles 
(2/8 of the left kidney) are resected after ligation of the left renal 
artery (He et al., 2012). After surgery, activation of the renin- 
angiotensin system (RAS) can cause glomerular 
hypertension/hyperfiltration (Ergür et al., 2015; Tapia et al., 
2012). Together with oxidative stress and inflammation, the 
glomerular hypertension/hyperfiltration ` finally results in 
glomerulosclerosis, tubulointerstitial injury, renal atrophy, 
proteinuria, and possible ESRD (Gong et al., 2016; Kim et al., 
2009). The remnant kidney model is highly influenced by the 
animal strain used. C57BL/6 mice are resistant to fibrosis or 
progressive CKD, whereas other animal strains such as rats 
and CD-1, 129/Sv, and Swiss-Webster mice are susceptible 
(Leelahavanichkul et al., 2010; Orlando et al., 2011). In 
addition, high mortality and little renal tissue after 5/6 
nephrectomy are also challenges to this model. 


Table 2 Advantages and disadvantages of experimental CKD mice models 


Pathology 


Models 


Advantages 


Disadvantages 


Reference 





Renal mass reduction 


Hypertension 


Diabetic nephropathy 


Primary glomerular 
nephropathy; focal 
segmental 
glomerulosclerosis 
Crescentic 
glomerulonephritis 


Membranous 
nephropathy 


IgA nephropathy 


Secondary nephrotic 
syndrome; Amyloid A 
(AA) amyloidosis 
Systemic lupus 
erythematosus 


Hereditary nephritis; 
polycystic kidney 
disease; Alport 
syndrome 


5/6 nephrectomy (rats) 


SHR rats+UNX; 
angiotensin II infusion 
models 

Streptozotocin mice/rats; 
NOD mice BB-DP rat; 
ob/ob mice db/db mice; 
DBA/2J mice; STZ-eNOS”; 
db/db-eNOS/ mice 
Adriamycin (rat, mice) 
models; Puromycin (rat) 
models 


Nephrotoxic nephritis 
model; anti-GBM nephritis 
model 

heymann nephritis rats; 
Cationic BSA mouse model 


ddY mouse, HIGA mice 
Uteroglobin-deficient mice 
CD89-transgenic mouse 


Injection of chemical or 
biological compounds 
models 

NZB/NZW F1 mice 
MRL and CD95 mutants 
model 

pkd1 or pkd2 gene 
engineered mouse; 
COL4A3 gene knockout 
mouse 


Mimics the progressive renal 
failure; after loss of renal mass in 
human 

Highly relevant to hypertension 
nephropathy; useful to study Angll 
effect over kidney 

Gene modified; commercially 
available; available on multiple 
strains 


Widely used; induce podocyte 
injury 


Similar to human Crescentic 
glomerulonephritis 
Widely used; identical pathology; 


marked proteinuria 


Reproduces human pathology; 
multiple models available 


Widely used; reproduce features 


of human diseases 


Widely used; marked proteinuria 


Widely used and useful to study 


PKD; major model; develop 
proteinuria and renal failure 


Highly influenced by back 
ground strains; surgery 
requires 

Surgery requires; high cost; 
slow progression 


No ideal model to mimics; 
diabetic nephropathy; 
expensive; some strains are 
infertile 


Highly depends on species 
and strains; toxic for most 
other cells 


Single symptom; difficult to 
induce 


Antigen (megalin) not found 
in human MN; limited 
experience 


Mild disease development 
usually without progression 
towards end-stage renal 
disease 

Rarely develop renal failure 


Incomplete features of SLE 


Ergur et al., 2015; He 
et al., 2012; Kim et al., 
2009 

Guo et al., 2015; 
Lankhorst et al., 2015; 
Zhong et al., 2016 

Betz & Conway, 2014; 
Graham & Schuurman, 
2015; Kitada et al., 
2016; Ostergaard et al., 
2017 

De Mik et al., 2013; 
Hakroush et al., 2014; 
Lee & Harris, 2011; 
Wada et al., 2016 
Borza & Hudson, 2002; 
Cheungpasitporn et al., 
2016 

Cybulsky, 2011; 
Jefferson et al., 2010; 
Motiram Kakalij et al., 
2016 

Eitner et al., 2010; 
Papista et al., 2015; 
Suzuki et al., 2014 


Kisilevsky & Young, 
1994; Teng et al., 2014 


Fagone et al., 2014; 
Nickerson et al., 2013; 
Otani et al., 2015 
Kashtan & Segal, 2011; 
Ko & Park, 2013; 
Korstanje et al., 2014; 
Ryu et al., 2012 


Diabetic nephropathy 

Diabetic nephropathy (DN) is the leading cause of ESRD. 
There are many kinds of rodent models relevant to diabetic 
nephropathy, but none of them perfectly mimics the human 
disease (Deb et al., 2010). The Animal Models of Diabetic 
Complications Consortium (AMDCC) defines the ideal rodent 
model of human diabetic nephropathy and complications (Kong 
et al., 2013; Kitada et al., 2016). The latest validated criteria 
are: (1) more than 5096 decrease in GFR; (2) greater than 10- 
fold increase in albuminuria compared with controls; and (3) 
pathological changes in kidneys including advanced mesangial 
matrix expansiontnodular sclerosis and mesangiolysis, 
glomerular basement membrane (GBM) thickening by >50% 
over baseline, arteriolar hyalinosis, and tubulointerstitial fibrosis. 


Classical type 1 diabetes can be modeled by the administration 
of streptozotocin (a toxin to B-cells that results in insulin 
deficiency), with spontaneous autoimmunity (e.g., NOD mice or 
BB-DP rat) or with gene mutation (Akita and OVE26 mice) 
(Graham & Schuurman, 2015; Kitada et al., 2016). A high fat 
diet is commonly used to induce obesity and insulin resistance 
and develop glomerular lesions in mice (Soler et al., 2012). 
Typical type 2 diabetes nephropathy (DN) model can be 
establised by leptin deficiency (e.g., ob/ob mice) or inactivation 
of the leptin receptor (e.g., db/db mice, Zuker rat) (Soler et al., 
2012). To exhibit more pathological features of human DN, 
recent studies have focused on (1) targeted gene knockout in 
mice (e.g., eNOS-deficient mice (Takahashi & Harris, 2014)), (2) 
selection of more susceptible rodent species and strains (eg. 
FVB (Chua et al., 2010) and DBA/2J mice (Østergaard et al., 
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2017), and (3) monogenic manipulations or superimposing 
additional key factors to accelerate nephropathy (e.g., STZ- 
eNOS“, db/db eNOS“) (Betz & Conway, 2014; Nakayama et 
al., 2009). 


Hypertension-induced renal injury 

Spontaneously hypertensive rats are usually used to investigate 
hypertension-induced nephropathy. Additionally, unilateral 
nephrectomy is required to promote significant renal injury with 
increased glomerular pressure and flow (Zhong et al., 2016). 
Chronic injection of angiotensin II for weeks also results in 
persistent hypertension and renal injury (Dikalov et al., 2014). 
Vascular endothelial growth factor (Lankhorst et al., 2015), 
Smad signaling (Liu & Davidson, 2012a), and inflammatory 
cytokines (Guo et al., 2015) are also involved in this process. 


Primary glomerular nephropathy 

Focal segmental glomerulosclerosis (FSGS) 

FSGS is a common primary glomerular disorder characterized 
by podocyte injury and loss and marked proteinuria (Fogo, 
2015). Although there is currently no primary FSGS model 
available, several secondary FSGS models have been 
established. Adriamycin (ADR) and puromycin are widely used 
to study FSGS. Single injection of these specific toxins can 
result in podocyte foot process effacement, deficient filtration 
barrier, and nephrotic syndrome (Fogo, 2003; Zhang et al., 
2013). However, the dosage of adriamycin is highly dependent 
on species and strain. Most rat species are susceptible to low 
doses of ADR ranging from 1.5-7.5 mg/kg (Lee & Harris, 2011), 
whereas most mouse strains are resistant to ADR. To produce 
a successful model, higher doses of ADR are required, for 
example 9.8—12 mg/kg in male BALB/C (Wada et al., 2016) and 
13-25 mg/kg in C57BL/c mice (Cao et al., 2010; Hakroush et 
al., 2014; Jeansson et al., 2009; Maimaitiyiming et al., 2016; 
Wang et al., 2000). 

Gene modification approaches in mice, such as inactivation 
of Mpv-17 (Casalena et al., 2014; Viscomi et al., 2009), 
knockout a-actinin-4 (De Mik et al., 2013; Henderson et al., 
2008) or NPHS2 (Mollet et al., 2009), or introducing the 
expression of Thy-1.1 antigen on podocytes, can also lead to 
proteinuria and FSGS (Smeets et al., 2004). 


Crescentic glomerulonephritis 

Antibodies fixation in the whole glomeruli (nephrotoxic nephritis) 
or GBM (anti-GBM nephritis) are the primary models used to 
mimic human crescentic glomerulonephritis (Hénique et al., 
2014). Intraperitoneal injection of heterologous antibodies to 
heterologous whole glomeruli can induce nephrotoxic nephritis 
(Gigante et al., 2011). Anti-GBM nephritis can be caused by 
immunization with the non-collagenous domains of the alpha-3 
chain of type IV collagen or passive transfer of anti-GBM 
antibodies (Cheungpasitporn et al., 2016; Kambham, 2012; 
Kvirkvelia et al., 2015). After treatment, severe proteinuria and 
azotemia appear in the following weeks. 


Membranous nephropathy 
Membranous nephropathy (MN) is a major cause of nephrotic 
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syndrome in the elderly and is characterized by subepithelial 
deposits and diffuse thickening of the GBM (Makker & Tramontano, 
2011). Active and passive Heymann nephritis model in rats 
closely resemble human MN and have been used to study MN 
(Sendeyo et al, 2013). 

Autologous antibodies are exposed to target antigens by 
injection of kidney extracts or antiserum to antigen generated in 
another animal species (Cybulsky, 2011; Jefferson et al., 2010), 
resulting in immune deposits associated with heavy proteinuria 
(Cybulsky et al., 2005). In rat models, megalin and receptor 
associated protein (RAP) are the major podocyte antigens 
targeted by the circulating antibodies (Ronco & Debiec, 2010). 
However, studies have shown that megalin is neither expressed 
in human podocytes nor detected in patients with membranous 
nephropathy (Beck & Salant, 2010; Ma et al., 2013). Recently, 
M-type phospholipase A2 receptor (PLA2R) was identified as a 
target antigen for autoantibodies in human MN (Debiec & 
Ronco, 2011; Herrmann et al, 2012; Kao et al., 2015). 
Additionally, circulating  thrombospondin  type-1 domain- 
containing 7A (THSD7A) has been detected in a subgroup of 
patients with idiopathic MN rather than PLA2R, suggesting a 
new target antigen in human MN (Tomas et al., 2014). 

The cationic BSA mouse model also produces features of 
human MN. Mice are preimmunized with cationic bovine serum 
albumin (CBSA) every other day for a week. Two weeks later, 
mice are reimmunized with cBSA in Freund's adjuvant (Motiram 
Kakalij et al., 2016). Mice will develop symptoms of MN, 
including severe proteinuria, diffuse thickening of the GBM, 
subepithelial deposits, and GBM spikes. 


IgA nephropathy (IgAN) 

IgAN is the most common form of glomerulonephritis, and is 
characterized by mesangial immune complex depositions that 
contain IgA1, IgG, complement C3, and IgM (Daha & Van 
Kooten, 2016). Inducible IgAN models include intravenous 
injection of IgA containing immune complexes to develop mild 
and transient IgAN (Rifai et al., 1979), and oral administration of 
protein antigens that result in mesangial IgA deposits 
(Emancipator et al., 1983). The ddY mouse is a spontaneous 
IgAN model derived from a non-inbred strain that develops 
glomerulonephritis and mild proteinuria without hematuria 
(Suzuki et al., 2014). Mouse line HIGA, an inbred strain with 
high levels of circulating IgA, shows significant early-onset 
immune deposits (Eitner et al., 2010). Other genetically 
modified mice, such as uteroglobin-deficient mice (Lee et al., 
2006) and CD89-transgenic mice (Moura et al., 2008; Papista 
et al., 2015), can also be used to investigate IgA nephropathy. 
Although some models are available, the underlying 
mechanism of IgAN is still not fully understood. 


Secondary nephrotic syndrome 

In this section, murine models of systemic lupus erythematosus 
and amyloidosis are reviewed. Transgenic murine models are 
widely used to investigate these complex diseases, especially 
systemic lupus erythematosus (SLE). Both MRL and CD95 
gene mutant animals can serve as research models to develop 
SLE symptoms and investigate potential therapies. 


Amyloid A (AA) amyloidosis 

Amyloid A (AA) amyloidosis is a serious complication of chronic 
inflammation. AA-type amyloid deposition can cause alteration 
in tissue structure and function, with the kidney noted to be a 
major target organ (Simons et al., 2013). Injection of chemical 
or biological compounds such as casein, lipopolysaccharide 
(Kisilevsky & Young, 1994; Skinner et al., 1977), an extract of 
amyloidotic tissue or purified amyloidogenic light chains (Teng 
et al., 2014) are widely used to create AA amyloidosis mouse 
models. However, unlike clinical AA-amyloid patients, these 
models rarely develop renal failure (Simons et al., 2013). In 
recent years, a striking transgenic murine model has been 
developed. Mice carrying the human interleukin-6 gene under 
the control of the  metallothionein-l promoter or with 
doxycycline-inducible transgenic expression of SAA provide 
another way to investigate AA-amyloid (Simons et al., 2013). 


Systemic lupus erythematosus (SLE) 

Lupus nephritis is characterized by autoantibodies against 
nuclear autoantigens such as DNA, histones, and nucleosomes 
(Liu & Davidson, 2012b). Most studies on SLE are based on 
murine models. Genetically modified models include MRL and 
CD95 mutants such as MRL" and FasL’ mice (Nickerson et 
al., 2013; Otani et al., 2015), BXSB mice (McGaha et al., 2005; 
McPhee et al., 2013), and NZB/NZW F1 mice (Fagone et al., 
2014), which are widely used to develop proteinuria, 
lymphoproliferation, and similar features relevant to human 
lupus nephritis (McGaha & Madaio, 2014). Recently, TWEAK- 
Fn14 signaling has been reported to play an important role in 
the progression of lupus nephritis and anti-TWEAK blocking 
antibodies can preserve renal function and increase survival 
rate in experimental models of CKD (Gomez et al., 2016; Sanz 
et al., 2014). Although multiple mouse models have been used 
to investigate lupus nephritis, each model has limitations that 
impede our understanding of the pathogenesis and clinical 
manifestations of this disease. Subsequently, no effective 
therapy for lupus nephritis currently exists. 


Hereditary nephritis 

Polycystic kidney disease (PKD) 

PKD includes a group of human monogenic disorders inherited 
in an autosomal dominant (ADPKD) or recessive (ARPKD) 
fashion. PKD is mainly restricted to the liver and kidney, and 
occurs in a range of ages from children to the elderly. In 
children and adults, ADPKD and ARPKD are the most common 
genetic nephropathies and leading causes of ESRD (Liebau & 
Serra, 2013). ADPKD is caused by mutation of either PKD1 
(85%) or PKD2 (15%) (Kim et al., 2014a), whereas ARPKD is 
caused by PKHD1 gene mutations (Sweeney & Avner, 2011). 
Although hereditary PKD is complex and diverse, it is normally 
induced by single mutations in single genes. Therefore, 
genetically engineered murine models are widely used to mimic 
human PKD. As homozygous mice of PKD1 or PKD2 result in 
embryonic lethality (Woudenberg-Vrenken et al., 2009), 
conditional knockouts, inducible strategies, or the introduction of 
unstable alleles are the major ways to establish experimental 
models (Ko & Park, 2013). There have been some successful 


clinical trials based on results from these models. For example, 
tolvaptan has been proven to be effective in ADPKD and is now 
marketed in Japan (Torres et al., 2012). Moreover, combination 
therapy of tolvaptan and pasireotide has brought significant 
reduction in cystic and fibrotic volume in a PKD1 mouse model 
(Hopp et al., 2015). 


Alport syndrome 

Alport syndrome (AS) is a hereditary glomerulopathy resulting 
from mutations in the type IV collagen genes COL4A3, 
COL4A4, or COL4A5, and is characterized by hematuria, renal 
failure, hearing loss, ocular lesions (Savige et al., 2011), and 
abnormal collagen IV composition in the GBM (Savige et al., 
2013). The COL4A3 gene knockout mouse is the major model 
used to study the pathogenesis of AS. Homozygous mice can 
develop proteinuria at 2-3 months of age and die from renal 
failure at 3-4 months (Kashtan & Segal, 2011). In COL42A3* 
mice, studies have shown that TNF-a contributes to Alport 
glomerulosclerosis by inducing podocyte apoptosis (Ryu et al., 
2012). Furthermore, spontaneous COL4A4 mutation in 
NONcNZO recombinant inbred mice exhibits early stage 
proteinuria associated with glomerulosclerosis. These 
genetically modified mice provide valuable models for potential 
therapy testing and help understand the mechanisms of AS 
(Korstanje et al., 2014). 


CONCLUSIONS 


Although AKI and CKD are significantly increasing worldwide 
and cause high mortality, clinical diagnosis and therapeutic 
interventions are lagging. AKI-CKD transition and the 
underlying mechanisms of complex CKD such as IgA 
nephropathy, diabetic nephropathy, and FSGS are still unclear 
and impede the search for potential therapies. Despite the 
valuable new insights into kidney disease gained from existing 
models, many do not fully reproduce human clinical diseases. 
Thus, improved murine models are still desperately needed to 
investigate potential diagnostic and therapeutic approaches. In 
AKI models, obtaining new mouse strains susceptible to 
toxins/drugs is urgent, and finding new approaches to develop 
stable and reproducible AKI models is necessary. As for CKD 
models, to develop complex and specific pathologies, mice with 
multiple genetically modified will be widely used to develop 
complex and specific pathology in the near future. Additionally, 
models that faithfully develop common conditions such as DN 
or SLE are also imperative. 
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ABSTRACT 

Cationic antimicrobial peptides (AMPs) are considered 
as important candidate therapeutic agents, which exert 
potent microbicidal properties against bacteria, fungi 
and some viruses. Based on our previous findings 
king cobra cathelicidin (OH-CATH) is a 34-amino 
acid peptide that exerts strong antibacterial and 
weak hemolytic activity. The aim of this research 
is to evaluate the efficacy of both OH-CATH30 and 
its analog D-OH-CATH30 against clinical isolates 


comparing with routinely utilized antibiotics in vitro. 


In this study, 584 clinical isolates were tested 
(spanning 2013-2016) and the efficacy of the 
candidate peptides and antibiotics were determined 
by a broth microdilution method according to 
the CLSI guidelines. Among the 584 clinical 
isolates, 85% were susceptible to OH-CATH30 
and its analogs. Both L- and D-OH-CATH30 
showed higher efficacy against (toward) Gram-positive 
bacteria and stronger antibacterial activity against 
nearly all Gram-negative bacteria tested compare 
with antibiotics. The highest bactericidal activity 
was detected against Acinetobacter spp., including 
multi-drug-resistant Acinetobacter baumannii (MRAB) 
and methicillin-resistant Staphylococcus aureus 
(MRSA). The overall efficacy of OH-CATH30 and its 
analogs was higher than that of the 9 routinely used 
antibiotics. OH-CATH30 is a promising candidate 
drug for the treatment of a wide variety of bacterial 
infections which are resistant to many routinely used 
antimicrobial agents. 


Science Press 


Keywords: Cathelicidin; Antibacterial agent; Clinical 
isolates; OH-CATH30 


INTRODUCTION 


The abundant use of traditional antibiotics has resulted in 
the emergence of many antibiotic-resistant isolates worldwide, 
causing threatening to human health and the rate of antibiotic 
discovery and production has been insufficient to respond to 
this crisis (Fischbach & Walsh, 2009; Nathan, 2004). Recent 
studies have shown that there are many components in the 
venom that possess antimicrobial activity. Snake venom, 
produced by specialized glands in the snake jawbone, is 
a particularly rich source of such antimicrobial compounds 
(Tashima et al., 2012). Several venom compounds have been 
used for antimicrobial effects, such as phospholipases A2, 
metalloproteinases, L-amino acid oxidases and antimicrobial 
peptides (De Oliveira Junior et al., 2013). Cathelicidins 
have been identified as the main family of naturally-occurring 
antimicrobial peptides from snake venom, which exhibit potential 
microbicidal properties against bacteria, fungi and some viruses 
(Zhang, 2015). Currently, at least 9 cathelicidin-type antimicrobial 
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peptides have been identified from elapid snake venoms, most 
of which exhibit potential antimicrobial activities (Falcao et 
al., 2014; Zhao et al., 2008). Cationic antimicrobial peptides 
(AMPs) are considered as important candidate therapeutic 
agents comprising a diverse group of bactericidal molecules 
for which microbial organisms show lower levels of resistance, 
for example pexiganan, an AMP developed for the treatment 
of diabetic foot infection (Ge et al., 1999). Although several 
peptide-antibiotics are well established clinically, such as 
polymixin B, AMPs alone without any modifications have not 


been widely used in clinical treatment (Hancock & Sahl, 2006). 


As an unique family of AMPs, the cathelicidin peptides shared 
conserved N-terminal domains from a variety of species and 
exhibited effective antimicrobial activity and some cytotoxic 
activity towards eukaryotic cells (Johansson et al., 1998; 
Zanetti et al., 1995). 


in vitro 


OH-CATH 30 (L-/ D-) 


We previously reported a novel cathelicidin-derived peptide 
from the king cobra. This reptile cathelicidin was 
termed OH-CATH30 and exhibited potential broad-spectrum 
antibacterial activity in vitro and relatively low toxicity in vivo 
(Li et al., 2012; Zhang et al., 2010). The aim of this study 
is to evaluate the efficacy of OH-CATH30 and its analog 
D-OH-CATH30 (comprising all D-amino acids residues) against 
Clinical isolates (collected from patients in hospital) compared 
with routinely utilized antibiotics in vitro. Meanwhile, the 
corresponding information is crucial for the cytotoxicity to 
eukaryotes and the immunogenicity of these peptides, which 
hemolysis assay and IL-6 assay were further investigated. 
Considering that most of AMPs have free radical scavenging 
ability, we also expanded nitric oxide assay and detected the 
release of nitric oxide (Figure 1A). 


in vivo 
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Figure 1 Cytotoxicity of OH-CATH30 and its analog 


A: The workflow of cytotoxicity determination to L-OH-CATH30 and D-OH-CATH30; B: Distribution of LOH-CATH30 and D-OH-CATH30 MICs for 584 bacterial isolates. 
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MATERIALS AND METHODS 


Ethics statement 

This study was approved by the Ethics Committee of Puer 
University and the Biomedical Ethics Committee of the First 
Affiliated Hospital of Kunming Medical University. The data were 
analyzed anonymously, patient identity deduced or disclosed. 


Clinical isolates, reference strains and materials 

A total of 584 clinical isolates were collected between July 
2013 and June 2016, which included 14 different species as 
follows: (i) Acinetobacter spp. (n=53); (ii) Citrobacter spp. 
n=7); (iii) Enterobacter spp. (n=41); (iv) Escherichia spp. 
n=229); (v) Klebsiella spp. (n=126); (vi) Proteus spp. (n=4); 
vii) Pseudomonas spp. (n=76); (viii) Salmonella spp. (n=16); 
ix) Serratia spp. (n=12); (x) Sphingobacterium spiritivorum 
n=1); (xi) Staphylococcus aureus (MSSA) (n=2) and S. aureus 
MRSA) (n=2); (xii) Stenotrophomonas maltophilia (n=9); (xiii) 
Streptococcus pneumoniae. (n=2); and (xiv) Yersinia spp. 
(n=4). The species identification of each clinical isolates 
was confirmed with the Vitek 2 system (bioMérieux, France; 
part of the data shown in Supplementary Figure S1). The 
antibiotic susceptibility of the clinical isolates was determined 
using the Kirby-Bauer disk diffusion method, in accordance 
with the Clinical and Laboratory Standards Institute (CLSI) 
2009 guidelines (Clinical and Laboratory Standards Institute, 
2009). Twelve control strains were also included: E. coli 
ATCC 25922 and ATCC 35218, Haemophilus influenza ATCC 
49766 and ATCC 49247, S. aureus ATCC 25923 and ATCC 
43300, Klebsiella pneumoniae ATCC 13883 and ATCC 700603, 
S. maltophilia ATCC 8090, Enterococcus faecalis ATCC 
29212, Enterobacter cloacae ATCC 13047 and Pseudomonas 
aeruginosa ATCC 27853. All strains were cultured in MHB 
Mueller-Hinton broth (MHB, pH 7.2) medium at 37 °C. All other 
reagents were of analytical grade and were obtained from 
commercial sources. 


Ka gion Cpe Sma pens, pat 


Antibiotics 

Ampicillin (Amp) and cefoperazone (CPZ) were acquired from 
the General Pharmaceutical Factory of Harbin Pharmaceutical 
Group (Harbin, China). Roxithromycin (ROX), chloramphenicol 
(CHL) and azithromycin (AZM) were from Sangon Biotech Co., 
Ltd. (Shanghai, China). Amikacin (AMK), vancomycin (VAN), 
levofloxacin (LVX) and polymyxin (PB) were from Sigma-Aldrich 
Co. LLC (Sigma, St. Louis, MO, USA). 


Antimicrobial peptides synthesis 

Pexiganan (GIGKFLKKAKKFGKAFVKILKK-NH2), OH-CATH30 
(KFFKKLKNSVKKRAKKFFKKPRVIGVSIPF) and its analog 
D-OH-CATH30 (KFFKKLKNSVKKRAKKFFKKPRVIGVSIPF, 
italics indicate D-amino acids) were synthesized by solid-phase 
synthesis on an Applied Biosystems Model 433A peptide 
synthesizer according to the manufacturer’s standard protocols, 
as reported previously (Li et al., 2012). Then, the crude synthetic 
peptide was purified by reverse-phase high-performance lipid 
chromatography (RP-HPLC, Supplementary Figure S2). Mobile 
phase eluent A consisted of 0.1% TFA (aqueous) and mobile 
phase eluent B consisted of ACN/ddH2O/TFA 90/10/0.08% 


(v/v/v). The sample was added along with auto-sampler to 
a Thermo Scientific EASY loading column (Thermo Fisher 
Scientific, USA) (2 cmx100 um, 5 um-C18) then to an analytic 
column (75 umx100 mm, 3 um-C18) with a flow rate of 1.2 
mL/min. The presence of peptides was confirmed by an 
absorption peak at 215 nm. The purity of each synthetic 
peptide was above 9596. The identity and purity of each peptide 
was further confirmed by matrix-assisted laser desorption 
ionization-time of flight (MALDI-TOF) mass spectrometry 
(Voyager; Supplementary Figure S2). The survey of the full 
scan MS spectra (m/z 300-1 800) was acquired in the Orbit 
rap with a 70 000-resolution (m/z 200). Dynamic exclusion was 
set to 15s. The 10 most intense multiply charged ions (272) 
were sequentially isolated and fragmented by higher-energy 
collisional dissociation (HCD) with a fixed injection time of 80 
ms and 17 500-resolution (m/z 200). The conditions were as 
follows: spray voltage, 2 kV; no sheath and auxiliary gas flow; 
heated capillary temperature, 250 °C; and normalized HCD 
collision energy 27 eV; underfill ratio 0.1%. The MS/MS ion 
selection threshold was set at 1x10? counts. 


Determination of MICs 

MICs were determined by the broth microdilution method 
(Wiegand et al. 2008) according to the CLSI guidelines 
(Clinical and Laboratory Standards Institute, 2009). In each 
well of a 96-well plate, 100 uL of MHB broth and bacteria 
(5x105 colony-forming unit (cfu)/mL) were incubated with 
various concentrations of antimicrobials for 16 h at 37 °C. The 
concentrations of the antimicrobials were the same for each of 
the tested bacterial strains in three independent experiments, 
and the MIC values were obtained without inter-experiment 
variations and expressed as pg/mL. For compounds and 
antibiotics drugs, the interpretation criteria were set by using 
the CLSI published criteria. The concentration of antibiotics 
was initially set at 512 g/mL, then diluted to 4 g/mL. As for 
the (only two are listed) peptides used in this study (pexiganna, 
polymyxin B), the concentration was diluted from 256 pg/mL to 
2 ug/mL. In each batch of susceptibility tests, the 12 control 
strains listed above were included as a quality control. 


Hemolytic assay and cytokine determination 

Hemolytic assays were investigated using human red blood 
cells in liquid medium as previously reported (Yang et al., 
2012). Serial dilutions of the samples were incubated with 
washed red blood cells (2%) at 37 °C for 30 min, the cells were 
then centrifuged, and the absorbance of the supernatant was 
measured at 540 nm. Maximum hemolysis was determined by 
adding 196 Triton X-100 to the cell samples (Liu et al., 2012). 

An ELISA kit (Neobiosoence) was employed for IL-6 
determination according to the manufacturer's instructions. 
Nitric oxide was quantified using the Griess chemical method 
(Park et al., 1993). 

IL-6 detection in vitro: Prior to euthanization, mice were 
administered 3 mL of saline by intraperitoneal injection. The 
abdomen was kneaded for 2 min before being opened up and 
the peritoneal fluid was collected. After excision, the mice livers 
and kidneys were placed into physiological saline, cut(sliced) 
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into pieces, homogenized with trypsin, ground into fluid, passed 


through a mesh sieve and centrifuged at 2 000 r/min for 10 min. 


The supernatant was discarded and the pellet was washed 
with PBS until the supernatant was clear. Then the cells 
were re-suspended in F12 medium and 190 uL of cells (about 
1x10’) were added to each well of a 96-well plate before the 
adding 10 pL of different concentrations of OH-CATH30 as the 
experimental group or F12 medium as the control. The plate 
was then incubated at 36 °C for 2 h, followed by centrifugation 
at 3 000 r/min for 20 min. The supernatant was retained for 
further determinations. 

IL-6 detection in vivo: Mice were intraperitoneally injected 
with 2 mL of saline, then after 5 h were intraperitoneally injected 
with 5 mg/kg or 10 mg/kg of different types of OH-CATH30 
(Li et al., 2013). Mice were euthanized, then the abdomen 
was kneaded for 2 min, after which it was opened and the 
peritoneal fluid was collected. After excision, the livers and 
kidneys were placed into physiological saline, sliced into pieces, 
homogenized with trypsin, ground into fluid, passed through a 
mesh sieve and centrifuged at 3 000 r/min for 20 min. The 
supernatant was discarded and the pellet was washed with 
PBS until the supernatant was clear. 


Statistical analysis 

The data was analyzed using two-way ANOVA. The 
experimental data are expressed as MlCso and Milon values. 
The level of statistical significance was set at P«0.05. 


RESULTS 


Susceptibility test 

The in vitro susceptibility of 584 clinical isolates to three AMPs 
and nine antibiotics was tested, which was also represented 
by its cytotoxicity of prokaryotes cell (Figure 1A). Table 1 
shows the MICs, with ranges of MICsọ and MICgo values, of 
each antimicrobial agent tested against 14 different species. 
Overall, the efficacy of OH-CATH30 and D-OH-CATH30 was 
higher when compared with 9 routine antibiotics, and slightly 
higher than pexiganan. Among the 584 clinical isolates 
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Figure 2 Hemolytic activity of the L- and D- OH-CATH30 peptides 


tested, 85% were susceptible to OH-CATH30 and its analog 
with a MIC<64 mg/L (Figure 1B). For Escherichia spp. and 
Klebsiella spp. (61% of all clinical isolates), OH-CATH30 
was more active than the 9 antibiotics tested, and showed 
slightly higher activity than pexiganan. For 53 strains of 
Acinetobacter spp., including multidrug-resistant Acinetobacter 
baumannii (MRAB), the MIC value of D-OH-CATH30 was 
doubled comparing with that of OH-CATH30, and polymixin 
B also had high rates of susceptibility (77%) to Acinetobacter 
spp. compared with other antibiotics, except for pexiganan. 
For the 76 strains of Pseudomonas spp., the activity of 
OH-CATH30 and its analog was higher than pexiganan, 
although these peptides are all showed higher activity than 
the 9 antibiotics, with the exception of levofloxacin. For the 
16 strains of Salmonella spp., all showed high susceptibility to 
the antimicrobial peptides and antibiotics, with the exception of 
ampicillin and cefoperazone. The rate of susceptibility (92%) to 
levofloxacin among these species was considerably higher than 
the other 8 antibiotics. Among the 14 species of clinical isolates 
tested, Serratia spp., for which there were 12 strains, had the 
lowest rate of susceptibility to antimicrobial peptides and the 
peptide-antibiotic (polymixin B). OH-CATH30 inhibited all of the 
species, comprising around 10 strains, at a concentration of « 
64 mg/L, with the exception of Proteus spp. and S. maltophilia. 
Of the four strains of S. aureus tested, including two MSSA 
and two MRSA, D-OH-CATH30 was found to be more effective 
at inhibiting MRSA than OH-CATH30, with a range of < 8 mg/L. 
Overall, L- OH-CATH30 showed higher efficacy against two 
thirds of the tested clinical isolates than OH-CATHSO. 


Hemolysis assay 

Generally, the peptide and its D- analog showed little difference 
in hemolytic activity to human red blood cells (Figure 1A). 
When the dose of L-OH-CATH30 or D-OH-CATH30 was lower 
than 125 g/mL, similar hemolysis rates of about 10% were 
observed, but as the dose increased to 250 pg/mL, high 
hemolytic activities exceeding 70% for the peptide and 80% for 
its D- analog were detected (Figure 2). 
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Peptides at various concentrations were incubated with human red blood cells for 4 h at 37 °C. 
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Table 1 MICs of three antimicrobial peptides and nine antibiotics determined for 584 clinic isolates, and the MIC5 and Milon values 











Percentage Percentage 
" of isolates d of isolates 

Isólatós No. OH-CATH30 MIC(mg/L) D-OH-CATH30 MIC (mg/L) Amp? MIC (mg/L) susceptible to* CPZ? MIC (mg/L) susceptible to° 

(%) (%) 

MICs? ` MIC go Range MICs? MICso? Range MICs?” MICg® Range MICso?: MICso*: 7 Range 

Acinetobacter sp. 53 16 32 8128 8 16 2->128 >256 >256 8-2256 0 2256 2256 4—>256 2 
Citrobacter sp. 7 8 16 4->128 8 >128 4->128 >256 >256 8—>256 20 >256 >256 4->256 40 
Enterobacter sp. 41 8 32 4->128 8 >128 4->128 >256 >256 64—>256 0 >256 >256 4->256 33 
Escherichia sp. 229 8 64 2->128 8 >128 2->128 >256 >256 8-2256 5 2256 2256 4->256 25 
Klebsiella sp. 126 8 64 4->128 16 128 2->128 >256 >256 16—>256 0 >256 >256 4->256 29 
Proteus sp. 4 >128 >128 8->128 8 128 8->128 | 2256 2256 16256 0 256 256 4-256 33 
Pseudomonas sp. 76 8 128 4->128 16 >128 4->128 >256 >256 16-2256 5 256 2256 4->256 23 
Salmonella sp. 16 8 8 4->128 4 >128 4->128 64 >256 4->256 54 32 >256 4->256 54 
Serratia sp. 12 >128 >128 4->128 8 64 8->128 >256 >256 128-2256 0 256 2256 4->256 36 
Sphingobacterium spiritivorum 1 8 8 8 16 16 16 >256 >256 >256 0 >256 >256 >256 0 
Staphylococcus aureus(MSSA) 2 16 16 16 32 64 32-64 >256 >256 >256 0 >256 >256 >256 0 
Staphylococcus aureus(MRSA) 2 16 32 16-32 8 8 8 >256 >256 >256 0 >256 >256 >256 
Stenotrophomonas maltophilia 9 8 >128 4->128 8 32 4-32 >256 >256 128-2256 0 256 2256 4->256 13 
Streptococcus pneumoniae 2 8 8 8 8 >128 8—2128 2256 2256 2256 0 2256 2256 2256 0 
Yersinia sp. 4 8 2128 8128 8 2128 8-252128 2256 2256 2256 0 2256 2256 2256 0 
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Continued 











Percentage Percentage Percentage Percentage 

ROX? MIC (mg/L) of isolates AZM? MIC (mg/L) of isolates AMK? MIC (mg/L) of jeolales PB^ MIC (mg/L) of ISBlatesn 

Isolates No. susceptible susceptible susceptible susceptible 
to? (96) to? (96) to? (96) to? (96) 

MICso? " Mie `" Range Milon? ` Milon" Range Milon? ` Milon `" Range Mie?  MICgo? Range 
Acinetobacter sp. 53 2256 2256 8—256 N/A 2256 2256 4—>256 N/A >256 >256 4->256 28 <2 32 2->128 77 
Citrobacter sp. 7 >256 >256 32->256 N/A 16 >256 4—>256 N/A 8 >256 4->256 60 <2 >128 2->128 100 
Enterobacter sp. 41 2256 2256 128—.256 N/A 8 2256 4—>256 N/A <4 >256 4->256 67 <2 128 2->128 69 
Escherichia sp. 229 >256 >256 16—256 N/A 8 128 4—>256 N/A <4 >256 4->256 66 <2 128 2->128 78 
Klebsiella sp. 126 256 >256 32-2256 N/A 8 2256 4—>256 N/A <4 >256 4->256 58 <2 >128 2->128 74 
Proteus sp. 4 >256 >256 >256 N/A 32 128 32-128 N/A <4 4 4 100 <2 >128 2->128 67 
Pseudomonas sp. 76 128 >256 4->256 N/A 64 >256 4—>256 N/A 8 >256 4->256 54 <2 >128 2->128 67 
Salmonella sp. 16 128 256 128—256 N/A «4 64 4-64 N/A <4 32 4->256 69 <2 64 2-64 77 
Serratia sp. 12 2256 2256 32-2256 N/A 16 2256 4—>256 N/A <4 64 4->256 73 >128 >128 2->128 
Sphingobacterium spiritivorum 1 >256 >256 >256 N/A 16 16 16 N/A <4 <4 <4 100 <2 <2 <2 100 
Staphylococcus aureus(MSSA} 2 256 >256 256-2256 0 «4 16 «4-16 50 «4 «4 «4 100 4 2128 4->128 0 
Staphylococcus aureus(MRSA)® 2 >256 >256 >256 0 8 >256 8—2256 0 «4 «4 «4 100 8 8 8 0 
Stenotrophomonas maltophilia 9 128 2256 8-2256 N/A 64 2256 4—2256 N/A 32 2256 4—»2256 50 «2 64 2-64 88 
Streptococcus pneumoniae 2 2256 2256 2256 0 8 2256 8—>256 0 <4 >256 «4—2256 100 «2 2128 <2—>128 50 
Yersinia sp. 4 32 >256 32-2256 N/A 16 256 16-256 N/A «4 64 4—64 100 «2 2128 2->128 100 
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Continued 











Percentage Percentage Percentage Percentage 
CHL MIC (mg/L) of Noon PEX® MIC (mg/L) of isolates LVX* MIC (mg/L) of isolates VAN? MIC(mg/L) of ieren 
Isolates No. susceptible susceptible susceptible susceptible 
to? (96) to^ (%) to? (%) to? (96) 
Milon? MIC go” Range MICso? ` Mie? Range MICso®"  MICag?" Range MICs? ` Milon" Range 
Acinetobacter sp. 53 128 256 8->256 5 8 16 4-128 56 128 >256 16->256 N/A 128 256 8->256 5 
Citrobacter sp. d 8 32 4-32 80 «4 32 4-32 60 256 2256 4->256 N/A 8 32 4-32 80 
Enterobacter sp. 41 16 2256 4->256 53 <4 32 4-128 72 256 >256 64—256 N/A 16 2256 4->256 53 
Escherichia sp. 229 <4 128 4->256 58 <4 128 4-256 55 128 >256 8—256 N/A «4 128 4->256 58 
Klebsiella sp. 126 16 256 4->256 46 >256 32 4-128 48 8 >256 8->256 N/A 16 256 4->256 46 
Proteus sp. 4 8 256 8-256 67 8 16 4—128 33 128 256 128—256 N/A 8 256 8-256 67 
Pseudomonas sp. 76 «4 2256 4->256 68 <4 64 4-128 63 >256 >256 64—256 N/A «4 2256 4—>256 68 
Salmonella sp. 16 <4 256 4->256 84 <4 4 4-128 92 >256 >256 4->256 N/A <4 256 4—>256 84 
Serratia sp. 12 64 >256 4->256 0 <4 16 4-32 55 >256 >256 64—256 N/A 64 2256 4->256 0 
Sphingobacterium spiritivorum 1 «4 «4 «4 100 «4 «4 <4 100 256 256 256 N/A <4 <4 <4 100 
Staphylococcus aureus(MSSA)d 2 <4 32 «4-32 50 8 8 8 0 «4 8 «4-8 50 «4 32 «4-32 50 
Staphylococcus aureus(MRSA)® 2 «4 «4 «4 100 16 32 16-32 0 «4 2256 <4—>256 50 <4 <4 <4 100 
Stenotrophomonas maltophilia 9 32 >256 4->256 50 <4 256 4-256 63 128 >256 128-2256 N/A 32 2256 4—2256 50 
Streptococcus pneumoniae 2 «4 «4 «4 100 16 32 16-32 0 «4 «4 «4 50 «4 «4 «4 100 
Yersinia sp. 4 «4 «4 «4 100 «4 4 4 100 256 2256 4->256 N/A <4 <4 <4 100 





a: MICsq and MICgg, MICs at which 50%, or 90% of isolates are inhibited, respectively; P: Amp, Ampicillin; CPZ, Cefoperazone; ROX, Roxithromycin; CHL, Chloramphenicol; AZM, Azithromycin; 
AMK, Amikacin; VAN, Vancomycin; LVX, Levofloxacin; PB, Polymyxin B; PEX, Pexiganan; °: Interpretive criteria based on CLSI recommendations (2009); d. MSSA, Methicillin-sensitive 
Staphylococcus aureus; *: MRSA, Methicillin-resistant Staphylococcus aureus. N/A: Not applicable. *: P<0.05, OH-CATH30 vs. the treated drugs or compounds. 
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Nitric oxide release 
The peptide and its D- analog displayed steady nitric oxide release 
when the concentration ranged from 500 pg/mL to 6.25 ug/mL. 
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The amount of nitric oxide released tripled than that in the vitamin 
C as control group, suggesting that a low concentration of OH- 
CATHSO stimulated relatively high nitric oxide release (Figure 3). 
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Figure 3 Nitric oxide release induced by L- and D- OH-CATH30 peptides in vitro 


IL-6 immunogenicity response 

L-OH-CATH30 is more effective in increasing the IL-6 
concentration during an immunogenicity response both in 
vitro and in vivo compared with the peptide and its D- 
analog. Interestingly, OH-CATH30 showed the lowest IL-6 
immunogenicity response in the kidney in vitro, whereas 
L-OH-CATH30 showed clear induction of IL-6 in the kidney in 
vivo. By comparison, the peptide and its D- analog promoted 
IL-6 potently in peritoneal fluid in vitro but had virtually no effect 
on the IL-6 immunogenicity response in vivo (Figure 4, 5). 


DISCUSSION 


Drug resistance in bacteria, in particular the rise of multi-drug 
resistance, poses a serious threat to human health that 
urgently requires new drugs with broad efficiency against 
clinically-encountered bacteria (Leid et al., 2012). AMPs, 
as novel therapeutic drugs, are being used increasingly to 
treat infections. In our previous study, OH-CATH30 was 
found to possess relative normal bactericidal activity and low 
toxicity in vivo and in vitro (Li et al., 2012), indicating that 
it is a competitive candidate for the development of novel 
antimicrobial agent. 

Here, we tested OH-CATH30 and its D- analog against 584 
Clinical isolates, including several of drug-resistant pathogens. 
The data showed that this peptide and its D- analog are broadly 
active against important medical pathogens including MRSA, 
MRAB, S. pneumoniae and Pseudomonas spp. Importantly, 
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this peptide showed high efficacy against Gram- positive 
bacteria equal to amikacin, and higher efficacy against 
almost all Gram-negative bacteria tested than that of the 
peptide-antibiotic polymixin B (shown in Table 1). This peptide 
also showed good antimicrobial activity against Acinetobacter 
spp., which is primarily associated with nosocomial infections 
in severely- ill patients in which the management of infections 
is difficult due to increasing resistance to multiple classes 
of antibacterial agents (Karageorgopoulos et al., 2008). Our 
results indicated that OH-CATH30 and its D- analog have 
higher efficacy against these strains compared with routine 
antibiotics. 

Usually, searching for new antibiotics, we particularly 
focus on the cytotoxicity and immunogenicity. Meanwhile, 
L-OH-CATH30 and D-OH-CATH30 showed low cytotoxicity to 
eukaryotic cells (red blood cell in our experiments), compared 
to bacterial cells (their high bactericide activity). They had 
little difference in hemolytic activity and free radical scavenging 
ability. IL-6 assay results show that both L-OH-CATH30 and 
D-OH-CATH30 promoted the release of IL-6 no matter in vitro 
or in vivo, which varied in different tissue and treatment doses. 
It indicated that this peptide could stimulate the immune system 
to protect against infections in vitro or in vivo. Furthermore, 
L-OH-CATH30 and D-OH-CATH30 also showed differences 
in bactericidal activity against a range of clinical isolates, 
suggesting that OH-CATH30 containing L- residues is more 
effective against some pathogens than OH-CATH30 containing 
D- residues. 
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Figure 4 IL-6 induced by L- and D- OH-CATH30 peptides in vitro 
ELISA was used for IL-6 determination. *: P«0.05. 
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Figure 5 IL-6 induced by L- and D- OH-CATH30 peptides in vivo 


Mouse was intraperitoneal injected 2 mL saline 5 h after they were intraperitoneal injected 10 mg/kg and 5 mg/kg the above peptides. ***: P«0.001. 
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ABSTRACT 


The Shimba Hills ecosystem along the south coast 
of Kenya is a key East African biodiversity hotspot. 
Historically, it is biogeographically assignable to the 
East African coastal biome. We examined the 
current Shimba Hills herpetofauna and their 
zoogeographical affinities to the coastal forests and 
nearby Eastern Arc Mountains biodiversity hotspots. 
The key studied sites included the Shimba Hills 
National Reserve, forest reserves, Kaya forests, and 
adjacent private land. Data on  herpetofaunal 
richness were obtained from recent field surveys, 
literature, and specimens held at the National 
Museums of Kenya, Herpetology Section Collection, 
Nairobi. The Makadara, Mwele, and Longo- 
Mwagandi forests within the Shimba Hills National 
Reserve hosted the highest number of unique and 
rare species. Generally, the forest reserves and 
Kaya forests were important refuges for forest- 
associated species. On private land, Mukurumudzi 
Dam riparian areas were the best amphibian habitat 
and were host to three IUCN (Red List) Endangered- 
EN amphibian species, namely, Boulengerula 
changamwensis, Hyperolius rubrovermiculatus, and 
Afrixalus sylvaticus, as well as one snake species 
Elapsoidea nigra. Using herpetofauna as 
zoogeographic indicators, the Shimba Hills were 
determined to be at a crossroads between the 
coastal forests (13 endemic species) and the 
Eastern Arc Mountains (seven endemic species). 
Most of the Eastern Arc Mountains endemic species 
were from recent records, and thus more are likely to 
be found in the future. This ‘hybrid’ species richness 
pattern is attributable to the hilly topography of the 
Shimba Hills and their proximity to the Indian Ocean. 
This has contributed to the Shimba Hills being the 
richest herpetofauna area in Kenya, with a total of 89 
and 38 reptile and amphibian species, respectively. 
Because of its unique zoogeography, the Shimba 
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Hills ecosystem is undoubtedly a key biodiversity 
area for conservation investment. 


Keywords: Amphibians; Reptiles; Eastern Africa 
coastal forests; Eastern Arc Mountains; Indicator 
species; Zoogeography 


INTRODUCTION 


Biogeography is the study of the geographic distribution of 
organisms, and includes historical, ecological, and conservation 
biogeography. Historical biogeography addresses the causes of 
the past distribution of species, whereas ecological 
biogeography studies the factors (mainly biotic and abiotic) that 
define the current spatial distribution of species (Monge-Nájera, 
2008). Conservation biogeography is the application of 
biogeographical principles, theories, and analyses, especially 
on species distribution to problems concerning biodiversity 
conservation (Whittaker et al., 2005). 

Global biodiversity hotspots are areas that support high 
species richness and endemism, and face considerable threats 
relative to the remaining area (Mittermeier et al., 2005; Myers et 
al., 2000, 2003). Currently, there are 35 global biodiversity 
hotspots, with two represented in Kenya (Sloan et al., 2014). 
One is the Eastern Afromontane hotspot represented by the 
Taita Hills plus mountain and highland areas in central (Mt. 
Kenya, Aberdare Range, and Nyambene Hills) and western 
(Kakamega Forest, Nandi Forests, Cherangani Hills, and Mau 
Hills) Kenya. The other is the coastal forests of eastern Africa 
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represented by the coastal forests of Kenya, with the two most 
well-known being the Arabuko-Sokoke Forest and Shimba Hills. 

Historically, the Shimba Hills have been categorized as a 
coastal forest (Burgess & Muir, 1994; Howell, 1993) and are 
one of the richest herpetological areas in Kenya. Other rich 
herpetofaunal areas include the Arabuko-Sokoke Forest (Chira, 
1993; Drewes, 1992); Kakamega Forest (Wagner et al., 2008; 
Lótters et al., 2007;; Wagner & Böhme, 2007; Schick et al., 
2005), and Taita Hills (Malonza & Veith, 2012; Malonza et al., 
2010; Burgess et al., 2007; Malonza, 2008; Newmark, 2002; 
Lovett, 1990). 

The Shimba Hills ecosystem has diverse habitat types that 
include forests, woodlands, grasslands, bushlands, and 
wetlands (rivers, swamps, and dams). This habitat diversity 
relates to the habitat heterogeneity hypothesis, which suggests 
that structurally complex habitats provide more niches and 
diverse ways of exploiting environmental resources and hence 
increase species diversity (Tews et al., 2004). In the Shimba 
Hills ecosystem, some of the key amphibian and reptile habitat 
areas are manmade dams such as Mukurumudzi. In addition to 
the gazetted forest reserves are the sacred Mijikenda 
community Kaya forests, which vary from small to large and are 
key sites that act as refuges for unique and rare biodiversity 
within community land  (Githitho, 2003). The flagship 
herpetofaunal species for the Shimba Hills ecosystem are the 
endemic Shimba Hills reed frog Hyperolius rubrovermiculatus 
Schiøtz, 1975, and Afrixalus sylvaticus (Schiøtz, 1974), which 
are listed as Endangered (EN) in the IUCN Red List. 

Past herpetological museum collections in the Shimba Hills 
have given an indication that this area might be a unique site, 
recording species both endemic to lowland coastal forests as 
well as the Eastern Arc Mountain forests. No past biodiversity 
work has assessed the exact biogeographical assignment of 
the Shimba Hills because it has been assumed to be a typical 
coastal forest. Recently, however, Bwong et al. (2014) 
examined some of the area's amphibians and hinted that the 
Shimba Hills National Reserve might be at a biogeographical 
crossroads. 

Therefore, the aim of this paper was to assess the amphibian 
and reptile fauna of the Shimba Hills ecosystem, with a focus 
on their zoogeographical affinities. Herpetofauna are good 
zoogeographical indicators due to their low dispersal abilities 
compared with higher vertebrates like birds and mammals (Bell 
& Donnelly, 2006; Pineda & Halffter, 2004). We used 
herpetofauna to test the hypothesis that the Shimba Hills 
ecosystem is a coastal forest. 


MATERIALS AND METHODS 


Study area 

The Shimba Hills ecosystem occurs along the south coast of 
Kenya in Kwale County (Figure 1). The ecosystem is diverse 
and dotted with important biodiversity sites ranging from forests, 
bushlands, and grasslands to wetlands, and all at varying levels 
of habitat disturbance. These sites occur from sea level up to 
about 430 m in hills within the Shimba Hills National Reserve. 
The Shimba Hills ecosystem includes protected areas such as 
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the Shimba Hills National Reserve, Buda Forest, Gongoni 
Forest, Mwaluganje Community Wildlife Conservancy, Kaya 
forests, community lands, and private dams such as 
Mukurumudzi. 


Herpetofaunal sampling 

Field sampling was conducted in different sites covering 
varying seasons from July 2012 to June 2017. Each sampling 
visit was done for two weeks covering different sites. The 
periods and frequencies of sampling visits are provided below: 


1. July/August 2012 (dry season) Mukurumudzi River Dam 
site (Miembeni, Nguluku, and Maumba); 

2. November/December 2012 (wet season) Mukurumudzi 
River Dam site (Miembeni, Nguluku, and Maumba); 

3. April/May 2013 (dry/wet season) Mukurumudzi River 
Dam site (Miembeni, Nguluku, Maumba, and dam-site); 

4. June 2013 (wet season) Mukurumudzi Dam (Miembeni, 
Nguluku, Maumba, and dam-site), Mtawa stream, and 
Gongoni Forest pond; 

5. September 2013 (dry season) Mukurumudzi Dam 
(Miembeni, Nguluku, Maumba, and dam-site); 

6. November/December 2013 (wet season) Mukurumudzi 
Dam (Miembeni, Nguluku, Maumba, dam-site, and 
tailings storage facility-Vumbu), Gongoni Forest, and 
Gongoni Forest pond; 

7. March/April 2014 (wet season) Mukurumudzi Dam 
(Miembeni, Nguluku, Maumba, dam-site, and tailings 
storage facility-Vumbu), Gongoni Forest, and Gongoni 
Forest pond; 

8. June/July 2014 (wet season) Mukurumudzi Dam 
(Miembeni, Nguluku, Maumba, dam-site, and tailings 
storage facility-Vumbu); 

9. September 2014 (dry season) Mukurumudzi Dam 
(Miembeni, Nguluku, Maumba, dam-site, and tailings 
storage facility-Vumbu); 

10. November/December 2014 (wet season) Mukurumudzi 
Dam (Miembeni, Nguluku, Maumba, dam-site, and 
tailings storage facility- Vumbu); 

11. March/April 2015 (dry season) Mukurumudzi Dam 
(Miembeni, Nguluku, Maumba, dam-site, and tailings 
storage facility-Vumbu); 

12. May 2015 (wet season) Shimba Hills National Reserve 
(Longo Forest, Makadara Forest, Mwele Forest, Kaya 
Kwale Forest, Shimba Lodge wetland, Mwadabara 
bushland, Mwadabara wetland, Kivumoni Gate wetland, 
and Sheldrick Falls wetland), Gongoni Forest, Gongoni 
pond, Buda Forest, Mwabanda community land wetland, 
and Mukurumudzi Dam (Miembeni, Nguluku, Maumba, 
dam-site, and tailings storage facility-Vumbu); 

13. August/September 2015 (wet season) Shimba Hills 
National Reserve (Longo Forest, Makadara Forest, 
Shimba Lodge wetland, and Mwadabara wetland), 
Gongoni Forest, Gongoni pond, Buda Forest, and 
Mukurumudzi Dam (Miembeni, Nguluku, Maumba, dam- 
site, and tailings storage facility- Vumbu); 
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Figure 1 Map of Kwale showing the Shimba Hills National Reserve, Buda, Gongoni and other protected forest reserves (Drawn by Edwin 
N. Gichohi) 
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November 2015 (wet season) Shimba Hills National 
Reserve (Longo Forest, Makadara Forest, Kaya Kwale 
Forest, and Shimba Lodge wetland), Gongoni Forest, 
Gongoni pond, Buda Forest, and Mukurumudzi Dam 
(Miembeni, Nguluku, Maumba, dam-site, and tailings 
storage facility-Vumbu); 

March 2016 (dry season) Shimba Hills National Reserve 
(Longo Forest, Makadara Forest, and Shimba Lodge 
wetland), Gongoni Forest, Buda Forest, Mukurumudzi 
Dam (Miembeni, Maumba, Nguluku, dam-site, and 
tailings storage facility- Vumbu), Kaya forests (Kinondo, 
Ukunda, Tiwi, Muhaka, Waa, Sega, Lunguma, Chonyi, 
and Gandini); 

May/June 2016 (wet season) Shimba Hills National 
Reserve (Longo Forest, Makadara Forest, Shimba 
Lodge wetland, and Mwadabara wetland), Gongoni 
Forest, Gongoni pond, Buda Forest, Mukurumudzi Dam 
(Miembeni, Maumba, dam-site, tailings storage facility- 
Vumbu), and Kaya forests (Kinondo, Muhaka, and 
Gandini); 

December 2016 (dry season) Shimba Hills National 
Reserve (Longo Forest, Makadara Forest, and Shimba 


18. 


Lodge wetland), Gongoni Forest, Mukurumudzi Dam 
(Miembeni, Maumba, dam-site, and tailings storage 
facility-Vumbu), and Kaya forests (Kinondo and 
Muhaka); 

June 2017 (wet season) Mukurumudzi Dam (Miembeni, 
Maumba, dam-site, and tailings storage facility-Vumbu), 
Shimba Hills National Reserve (Longo Forest, Makadara 
Forest, Mwele Forest, Shimba Lodge wetland, 
Mwadabara wetland, and Mwele Forest stream), 
Gongoni Forest, Gongoni pond, and Kaya forests 
(Kinondo and Muhaka). 


Two standard methods (timed-species count and pitfall 
trapping) were used during field sampling. 


(i) Time-limited searching (TLS): This is a form of timed- 
species counting. Here, sampling was conducted for a limited 
time period of two hours during the day and one hour at night 
within a given selected habitat without setting boundaries using 


three 


observers. The procedure is similar to the time 


constrained search-and-seize method described previously 
(Heyer et al., 1994; Karns, 1986). It involves quietly and slowly 
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walking and recording all individual reptiles and amphibians 
found in different micro-habitats such as on the ground, in 
debris, under decomposing logs, on trees, shrubs, grass, reeds, 
and sedges, for one-person hour. It also involves digging in 
suitable micro-habitats in search of burrowing species such as 
skinks, caecilians, frogs, and snakes. For reptiles, this was 
done mostly in the morning session when they are most active, 
and can be observed basking, resting, or foraging. Night 
searches for amphibians involved visiting various selected 
wetlands (river beds/edges, ponds, pools, and marshlands) for 
one hour from 19:00hrs. In wetlands at night we used both 
visual and acoustic techniques to record frog species. 

(ii) Pitfall traps associated with drift fence: An X-shaped drift 
fence with pitfall traps, consisting of 5-m long segments, was 
used in this study (Malonza et al., 2011). This is a modification 
of the single array design used by Corn (1994). A drift fence (30 
cm high) was stapled vertically onto wooden stakes or pegs. 
The pitfall traps consisted of 10-L plastic buckets flush to the 
ground, with every trap array having five buckets in total. The 
traps were normally opened and checked for five nights and 
were mostly used for detection of small, primarily nocturnal, 
crawling herpetofauna not easily detected through other 
methods. This method was only used in four sites within 
Mukurumudzi Dam. 

(iii) Opportunistic encounters: This qualitative method 
involved recording all species found or encountered 
opportunistically for every site outside of the ordinary sampling 
period of the two standard methods mentioned above. Data 
from other surveys outside the systematic surveys were also 
included. 


Shimba Hills National Reserve 

The Shimba Hills National Reserve is the main protected area 
in this ecosystem and is comprised of different habitat types, 
including forests, grasslands, woodlands, bushlands, and 
wetlands (rivers and swamps). It is managed by the Kenya 
Wildlife Service (KWS) in collaboration with the Kenya Forest 
Service (KFS). The following sites were sampled during this 
study: Makadara Forest (S04°14.253'; E039723.742'" 426 m 
a.s.l.); Longo-Mwagandi Forest (S04^14.264 E039 25.566" 
398 m); Mwele Forest (S04°17.198'; E039°21.766'; 334 m); 
Kaya Kwale Forest (S04°10.888', E039°26.737', 417 m); Mwele 
stream spring (S04°16.323', E039 22.214"; 329 m); Shimba Hills 
Lodge wetlands (S04°11.559', E039°25.555'; 290 mj 
Mwandabara 1 water pump station roadside wetlands 
(80410.854" E039725.158; 159 m); and Sheldrick Falls 
(804717.098 E039 25.849", 146 m). 


Forest reserves 

The forest reserves are managed by the Kenya Forest Service 
(KFS), formerly the Forest Department (FD), which allows local 
communities to harvest sustainably some forest products, such 
as dead fuelwood. The following sites were sampled during this 
study: Gongoni Forest (S04°24.176', E03927.604 42 m); 
Gongoni Forest pond (S04°25.778', E03927.755'; 24 m); and 
Buda Forest (S04° 25.734', E039" 24.292"; 100 m). 
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Kaya forests 

Kayas are sacred sites that owe their continued protection to 
the long-standing and environmentally friendly Mijikenda 
community traditions, taboos, beliefs, and cultures. The 
Mijikenda (nine sub-tribes) includes Giriama, Chonyi, Rabai, 
Ribe, Kauma, Kambe, and Jibana of Kilifi County along the 
north coast of Mombasa plus Digo and Duruma of Kwale 
County along the south coast. These traditions regulate access 
and conduct within these forests, threatening dire punishment 
from the spirit world for those who flout the rules (Githitho, 
2003). The Kayas occur on hill tops, ridges, valleys, or along 
coastal beach fronts. Because of their cultural values to the 
community, most Kayas are protected as monuments under the 
National Museums and Heritage Act, 2006, and placed under 
the management of the National Museums of Kenya (NMK). In 
addition, some are protected as forest reserves and managed 
by the KFS, but all are managed by the local community Kaya 
elders in collaboration with either the NMK or KFS. In addition, 
several Kayas, namely Kaya Giriama (Fungo), Rabai Kayas 
(Mudzimuvya, Bomu, and Fimboni), Kaya Kambe, Kaya Jibana, 
Kaya Ribe, Kaya Kauma (all in Kilifi), and Duruma Kayas 
(Gandini and Mtswakara) in Kwale were listed as World 
Heritage sites by United Nations Educational Scientific and 
Cultural Organizatio (UNESCO) in 2008. This was based on 
their roles as living legacies of the local people's history, culture, 
and religion. Currently, however, many Kayas are now highly 
degraded because the youthful generations disregard these 
traditions. The following are the Kwale Kaya forests sampled in 
this study (only during the day): Kaya Kinondo (S04°23.588'; 
E039°32.810'; 8 m); Kaya Ukunda (S04°18.783'; E039°34.047'; 
26 m); Kaya Tiwi (S04715.280'; E039°35.824', 20 m); Kaya 
Muhaka (S04°%19.941'; E03931.054; 50 m) Kaya Waa 
(80412.131 E039 36.827"; 25 m); Kaya Sega (S04°033.254’; 
E039°04.745'; 74 my Kaya Lunguma  (S04707.570* 
E039 31.198 132 m); Kaya Chonyi (S04°03.962'; E039° 
31.777'; 52 m); and Kaya Gandini (S04° 02.163'; E039" 30.426"; 
153 m). 


Mukurumudzi Dam 

The Mukurumudzi Dam area was initially a human settlement 
along the Mukurumudzi River just below Shimba Hills town. The 
residents were resettled to pave way for mineral sands mining 
and subsequent habitat restoration by Base Titanium Limited. 
Construction of the dam earth embankment started in 2011, 
with the dam undergoing initial filling in May 2013. Pre-dam 
herpetological monitoring in four sites started in July 2012 and 
continued with post-dam monitoring from June 2013 to June 
2017. After the dam was filled, the monitoring sites were divided 
into upstream and downstream sites. The sites selected were 
those with substantial remnant indigenous forest patches, 
assumed to act as species refuges for the once expansive 
coastal forests. In addition, due to their location at the edge of 
the dam, all sites have wetlands for night frog sampling. The 
major aim was to determine the impacts of the dam on reptile 
and amphibian composition over time. The following areas were 
sampled in this study: Upstream sites: Miembeni (80422.512', 
E039°25.529', 56 m); Maumba (S04'23.520', E039°25.774'; 82 m); 


and Nguluku (S04°23.967', E039°25.360'; 87 m); Downstream 
sites: dam-site (S04°24.444', E039°26.021'; 37m); and tailings 
storage facility- Vumbu (S04°23.846', E039°27.057'; 43 m). 

Community sites sampled: The Mwabanda wetland, which is 
a wetland within a rice field stream valley (S0426.499', 
E03926.399" 32 m), and Mtawa stream (S04°21.217', 
E039°28.125; 74 m) were sampled in this study. 


Species data 
Data on species presence were acquired from literature and 
past and recent fieldwork, as well as early specimen collections 
housed at the Herpetology Section of the NMK since 1960s. 
Collected representative voucher specimens were 
euthanized in a humane manner according to standard 
protocols. Amphibian specimens were euthanized with MS222 
solution and reptiles with pentobarbital solution and then 
preserved in 10% formalin and stored in 70% ethanol solution. 
Selected tissue samples were taken and stored in absolute 
ethanol for later molecular analysis. Color photographs of 
selected species and their habitats were taken. A 12 Channel 
Garmin® receiver was used to collect GPS data. The collected 
voucher materials were deposited at the NMK in Nairobi. 


RESULTS 


Species richness 

Based on recent field collections, records, and past museum 
specimens, a total of 89 reptile (47 snakes, 39 lizards, 1 
terrapin, 1 tortoise, and 1 crocodile) and 38 amphibian species 
(36 frogs and 2 caecilians) occur in the Shimba Hills 
ecosystem. In comparison with other herpetofaunal-rich areas 
in Kenya, the Arabuko-Sokoke Forest has a total of 80 reptile 
(46 snakes, 31 lizards, 1 terrapin, and 2 tortoises) and 30 
amphibian species (all frogs), the Kakamega Forest has 61 
reptile (38 snakes, 21 lizards, 1 terrapin, 1 tortoise) and 24 
amphibian species (all frogs), and the Taita Hills have 65 reptile 
(34 snakes, 28 lizards, 1 terrapin, and 2 tortoises) and 26 
amphibian species (24 frogs and 2 caecilians) (Supplementary 
Table 1, available online). 


Species of conservation concern 

We used the IUCN Red List of threatened species to identify 
species of conservation concern. In addition, we selected other 
species with a distribution range restricted to coastal forests 
and the Eastern Arc Mountains of East Africa that were either 
rare and/or associated with indigenous forests. The Makadara, 
Longo-Mwagandi, and Mwele forests in the Shimba Hills 
National Reserve were the specific forest sites where most 
species of conservation concern were recorded. The entire 
species list and distribution in different sites is given in 
Supplementary Table 1 (available online). Makadara Forest 
species: Scolecomorphus vittatus (Boulenger, 1895), single 
specimen from the NMK Herpetology Collection; Callulina 
krefffti (Nieden, 1911), first collected in 1961 (Loader et al., 
2010). Longo-Mwagandi Forest species: Prosymna semifasciata 
(Broadley, 1995), photographed in September 2014; Kinyongia 
tenuis (Matschie, 1892), IUCN Red List Endangered (EN), 


collected at the main gate forest and last collected in July 1981. 
Makadara and Longo-Mwagandi forests: Elapsoidea nigra 
(Günther, 1888), IUCN Red List Endangered (EN), also present 
in Mukurumudzi Dam forest patch; Cnemaspis africana 
(Werner, 1895), not collected in the past; Philothamnus 
macrops (Boulenger, 1895), also present in Mukurumudzi Dam 
(dam-site and Maumba) and Gongoni Forest. Melanoseps 
pygymaeus (Broadley, 2006), present in Longo-Mwangadi, 
Makadara, Buda, Gongoni, and Kaya Kinondo forests. Holaspis 
laevis (Werner, 1895), present in Longo-Mwagandi, Gongoni, 
and Buda forests, an arboreal and agile species first recorded 
and collected in Kenya in October 2009 in the Kaya Jibana 
Forest in Kilifi (Malonza & Bauer, 2014). Lygodactylus broadleyi 
(Pasteur, 1995), present in Longo-Mwagandi, Makadara, 
Gongoni, Buda, Kaya Muhaka, and Kaya Kinondo forests, as 
well as the Miembeni Forest patch in Mukurumudzi Dam. 
Cordylus tropidosternum (Cope, 1869), collected in Makadara 
Forest, Kaya Kwale Forest, and Kaya Gandini Forest. 
Boulengerula changamwensis (Loveridge, 1932), IUCN Red 
List Endangered (EN), a soil burrowing amphibian species 
present in Longo-Mwangandi, Makadara, and Buda forests, as 
well as Mukurumudzi Dam sites (Miembeni and Maumba forest 
patches). Gastropholis prasina (Werner, 1904), present in the 
Shimba Hills National Reserve in the Sheldrick Falls area, and 
recorded at the main gate forest and in Buda Forest station. 
Amphibian species that exhibit water-dependent breeding 
included: Leptopelis grandiceps (Ahl, 1929), IUCN Red List 
Vulnerable (VU), a forest dependent species only present in 
Mwele Forest stream spring; and reed frogs Hyperolius 
rubrovermiculatus (Schiøtz, 1975) and Afrixalus sylvaticus 
(Schiøtz, 1974), both categorized as IUCN Red List 
Endangered (EN) and present in wetlands in the Shimba Hills 
National Reserve (e.g. Shimba Lodge, Mwadabara, Sheldrick 
Falls, and Mwele Forest stream), Gongoni Forest, Buda Forest, 
and Mukurumudzi Dam site wetlands. 

The search continues to verify the status of historical records 
of Gastropholis vittata (Fischer, 1886), Bitis gabonica (Duméril 
& Bibroni, 1845), and Hemidactylus modestus (Günther, 1894) 
in the Shimba Hills National Reserve and surrounding area 
(Spawls et al., 2002). 


Shimba Hills biogeographical assignment 

The Shimba Hills have been historically categorized as part of 
the coastal forests of eastern Africa. However, below is a 
comparison of the species so far recorded in the Shimba Hills 
ecosystem that are endemic to the two different 
biogeographical zones. 

Eastern Arc Mountains species: Currently there are seven 
typical Eastern Arc Mountains herpetofaunal species. However, 
the recorded Eastern Arc Mountain species has been 
increasing steadily in the last several decades. Initially, the 
species known from the Shimba Hills National Reserve only 
included Kinyongia tenuis (last collected in August 1981). 
Recent collections of this species and records of other Eastern 
Arc endemics such as the snakes Elapsoidea nigra and 
Prosymna semifasciata, caecilian Scolecomorphus vittatus, and 
frogs Callulina kreffti and Leptopelis grandiceps have been 
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made. In addition, there is also an early collection of a pygmy 
chameleon assignable to Rhampholeon temporalis (Matschie, 
1892), but it has yet to be re-collected. 

Coastal forest species: Including the endemic Hyperolius 
rubrovermiculatus and Afrixalus sylvaticus and Rieppeleon 
brevicaudatus (Matschie, 1892) species, 13 East African 
coastal forest endemic species have been recorded in the 
Shimba Hills ecosystem. These include Hemidactylus barbouri 
(Loveridge, 1942), Hemidactylus mrimaensis (Malonza & 
Bauer, 2014), Lygodactylus mombasicus (Loveridge, 1935), 
Melanoseps pygmaeus, Lygosoma pembanum (Boettger, 1913), 
Lygodactylus broadleyi (Pasteur, 1995), Letheobia swahilica 
(Broadley & Wallach, 2007), Letheobia lumbriciformis (Peters, 
1874), Leptotyphlops macrops (Broadley & Wallach, 1996), and 
Boulengerula changamwensis. This biogeographical analysis of 
the Eastern Arc and coastal forests of the Shimba Hills species 
gives a ratio of 7:13. 


DISCUSSION 


The Shimba Hills stand out as the most reptile and amphibian 
species rich area in Kenya, particularly due to the endemic and 
near-endemic species. Undoubtedly, the Makadara, Longo- 
Mwagandi, and Mwele forests within the Shimba Hills National 
Reserve are key forest areas because they support all currently 
known rare and unique herpetofaunal species in this 
ecosystem. Outside the Shimba Hills National Reserve, 
Mukurumudzi Dam is the best amphibian habitat. Following the 
creation of the dam, the ongoing habitat rehabilitation and 
protection efforts have created new habitats for species, while 
former areas have improved in quality. This agrees with other 
studies elsewhere on the effects of large water dams on 
biodiversity (McAllister et al., 2000). Although the Gongoni, 
Buda, and Kaya forests are highly disturbed and degraded, 
they also support some unique forest-associated species. This 
concurs with past findings in the Kitobo Forest in south-eastern 
Kenya, which was found to act as a refuge for several 
threatened forest species despite being surrounded by 
intensive community settlement and agriculture (Malonza et al., 
2011). 

Using reptiles and amphibians as zoogeographical indicators, 
there was no clear support of the hypothesis that the Shimba 
Hills area is biogeographically a typical coastal forest. Our 
results showed that the Shimba Hills are at a biogeographical 
crossroads, supporting a good number of typically Eastern Arc 
as well as East African coastal forest species (Burgess et al., 
2007). This observation was also noted in a brief analysis of the 
Shimba Hills amphibians by Bwong et al. (2014). Given that 
most of the Eastern Arc endemics are rare and recent species 
records, it is likely that their numbers might increase with 
additional sampling. In addition, further analysis of all forms of 
biodiversity (flora and fauna), including molecular taxonomy, is 
needed to clearly understand the biogeographic history of the 
Shimba Hills biodiversity. 

With their low undulating hills and plateaus, unlike the 
ordinary coastal lowland forests but like the Eastern Arc 
Mountains, the Shimba Hills present the first break or barrier to 
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the moist rainfall winds from the Indian Ocean maintaining a 
stable montane forest environment. The proximity of the 
Shimba Hills to the Indian Ocean has resulted in a stable 
climate, leading to its diverse and unique herpetofauna. The 
hilly topography (up to ~430 m) differentiates the Shimba Hills 
from typical lowland coastal forests like Arabuko-Sokoke. 
Furthermore, although the humid montane forests are like 
islands that promote speciation similar to that observed in the 
Eastern Arc Mountains, most of the species found in the 
Shimba Hills occur at much lower elevations than typically 
recorded elsewhere in their range. For example, the Usambara 
garter snake (Elapsiodea nigra) has been recorded below 100 
m in the Mukurumudzi Dam basin, whereas in Tanzania it is 
known from 300-2 000 m (Menegon et al., 2008; Spawls et al., 
2002). 

Biogeographically, the Shimba Hills are an important reptile 
and amphibian area supporting unique and diverse species of 
conservation concern. While substantial work has been covered 
in this study, more extensive and intensive fieldwork using all 
standard herpetofaunal sampling methods covering different 
seasons and habitat types is highly recommended. 
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ABSTRACT 


Due to a paucity of surveys in northern Indochina 
and lack of international collaborations among 
neighboring countries, recognized  distributional 
ranges for many amphibian and reptile species end 
at the political borders for some countries, despite 
seemingly continuous suitable habitat spanning the 
region. Combining both morphological and genetic 
data, we report the first discovery of Japalura 
chapaensis, a rare agamid lizard believed previously 
to be endemic to northern Vietnam only, along the 
border region of southeastern Yunnan Province, 
China. To facilitate future research on the genus 
Japalura sensu lato in Indochina, we provide 
detailed descriptions of additional specimens of this 
rare species, including the first description of 
coloration in life and an expanded diagnosis, and 
discuss the species boundary of J. chapaensis with 
respect to its congeners. 


Keywords: Draconinae; Indochina; Morphological 
Variation; New record; Range extension; Systematics; 
Taxonomy 


INTRODUCTION 


Species with distributions spanning the political jurisdictions of 
multiple countries pose special challenges to biodiversity and 
conservation studies. Not only are conservation assessments 
and protection of this class of evolutionary lineages more 
difficult, but also, variation in the understanding of regional 
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biodiversity can lead to conflicting or questionable taxonomic 
and biogeographic patterns for widespread species. This 
situation has been particularly pronounced in peripheral island 
systems, including the Philippines (Esselstyn et al., 2004; 
Esselstyn & Oliveros, 2010; Siler et al., 2014), eastern 
Indonesia (Taylor, 2003), and Hawaii (Gillespie et al., 2012), 
among others (Gillespie & Claque, 2009). However, even for 
species with broad geographic distributions on continental 
landmasses, the paucity of communication across political 
boundaries has resulted in species being recognized as 
restricted to one side of a country's border or the other, despite 
evidence of continuous distributions spanning the boundaries of 
multiple countries (Dudgeon et al., 2006; Hannah et al., 2002). 
This is exemplified in regions of Southeast Asia, particularly 
northern Indochina. 

The border between southern China and northern Vietnam 
has posed problems for studies of amphibian and reptile 
diversity. Historically, biodiversity research on the herpetofauna 
across this region has been conducted independently in China 
and Vietnam, with interpretation of research often slowed by 
language barriers and lack of regional collaboration (Ananjeva 
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et al., 2007; Bain et al., 2009; Yang & Rao, 2008; Zhao et al., 
1999). However, despite the similar habitat structures across 
this region, and evidence of continuous distributions for 
ecologically similar species of frogs and lizards (Bain et al., 
2009; Ota, 2000), much of the herpetofauna is still thought to be 
endemic to one side of the political border or the other. An 
example of this is the Chapa Mountain Dragon, Japalura 
chapaensis Bourret, 1937. 

First described as a subspecies of Japalura swinhonis 
Günther, 1864, the description of J. chapaensis was based 
originally on a single female specimen (MNHN 1948.45) from 
Chapa (now "Sa Pa") in Lao Cai Province, northern Vietnam, 
close to the China-Vietnam border (Bourret, 1937). Ota (1989) 
later elevated J. chapaensis to full species based on 
examination of morphological characters, and three years later, 
Ota & Weidenhófer (1992) described the first and only male 
specimen of the species (KUZ 20097) from the type locality. 
However, despite some descriptive comparisons to morphologically 
similar species, low sample size (two specimens of J. 
chapaensis, both with incomplete tails reported) and limited 
diagnostic analysis against other congeners in the region, has 
limited our understanding of the phenotypic variation and 


diagnostic features of J. chapaensis (Ota, 1989; Ota & 
Weidenhófer, 1992). Currently, Japalura chapaensis is known 
from only two voucher specimens only, with distribution records 
limited to the eastern slopes of the Hoang Lien Son mountain 
ridge in northern Vietnam (Ananjeva et al., 2007; Bourret, 1937; 
Cai et al, 2015; Manthey, 2010; Van Sang et al., 2009). 
However, as the Hoang Lien Son Mountains continue and 
extend to the southern part of the Yunnan Province in China, 
thus offering similar habitats in close geographic proximity to 
the type locality of J. chapaensis in northern Vietnam, it remains 
possible that J. chapaensis also occurs across the border in the 
southern edge of China. 

In November 2015, we collected three agamid lizards from 
Lvchun, Honghe Prefecture, Southern Yunnan, China, close to 
the China-Vietnam border (Figure 1). Based on newly collected 
genetic and morphological datasets, we confirmed the identity 
of these Chinese specimens as J. chapaensis. Here, we extend 
the recognized distribution of J. chapaensis into China, provide 
a description of additional specimens of this rare species, and 
discuss morphological variation of J. chapaensis with relation to 
mainland congeners, particularly J. yunnanensis. 





Figure 1 Distribution of Japalura sensu lato in northern Indochina 


Stars indicate type localities for J. yunnanensis (A: Longling, Baoshan, Yunnan Province, China) and J. chapaensis (B: Sa Pa, Lào Cai Province, 
Vietnam); circle shows new locality of J. chapaensis (C: Lvchun, southern Yunnan, China). 


MATERIALS AND METHODS 


Field sampling 

A total of three specimens of J. chapaensis (an adult male, an 
adult female, and a sub-adult female) were collected from 
Lvchun, Honghe Prefecture, southern Yunnan Province, China 
on 10 November 2015 (N22.994 1°, E102.398 9°, 1 550 m 
elevation, WGS 84), and a topotypic newborn was collected 
from the vicinity of Tram Don mountain pass, Hoang Lien Son 
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National Park, Sa Pa, Lao Cai Province, Vietnam (GPS 
N22.353 1°, E103.774 9°, 1 950 m elevation, WGS 84; Figure 
1). After euthanasia, tissue samples were taken from the liver 
and preserved in 9596 ethanol, and the voucher specimens 
were fixed in 1096 buffered formalin and transferred to 7096 
ethanol after fieldwork. All Chinese specimens were deposited 
at the Museum of Kunming Institute of Zoology, Chinese 
Academy of Sciences, and the Vietnamese specimen was 
deposited at the Zoological Museum of Lomonosov Moscow 
State University, Moscow, Russia. 


Morphometrics 

We examined all morphological characters used in previous 
descriptions of J. chapaensis (Bourret, 1937; Ota, 1989; Ota & 
Weidenhófer, 1992), as well as additional characters following 
Wang et al. (2016). Morphometric data were recorded by JK 
using a digital caliper to the nearest 0.01 mm, except for tail 
length, which was recorded using a ruler to the nearest 1 mm. 
The following morphological characters were examined: 
snout-vent length (SVL), tail length (TAL), trunk length (TRL), 
head width (HW), snout-eye length (SEL), interorbital 
distance (IOD), fore-limb length (FLL), hind limb length (HLL), 
Toe IV length (T4L), Finger IV subdigital lamellae count (F4S), 
supralabial count (SL), infralabial count (IL), middorsal scale 
count (MD), Toe IV subdigital lamellae count (TAS), number of 
scales between nasal and first supralabial (NSL), supraciliary 
count (SCL), number of scale rows between the sixth supralabial 
and orbit circle (SOR), enlarged, conical, occipital scale count 
(COS), enlarged, conical, post-tympanic scale count (PTY), and 
enlarged, conical, postrictal scale count (PRS). 

Summaries of specimens examined are presented in 
Appendix. For comparisons with other phenotypically similar 
species, morphological data were collected from type or 
topotype specimens when available. In addition to the vouchered 
specimens examined, morphological data of congeners were 
obtained from literature (Manthey et al, 2012; Ota & 
Weidenhófer, 1992). For maximum comparability and 
consistency, color description terminology followed Köhler 
(2012). Museum abbreviations followed Sabaj (2016), including: 
Chengdu Institute of Biology, Chinese Academy of Sciences 
(CIB); Kunming Institute of Zoology, Chinese Academy of 
Sciences, Kunming, China (KIZ); Department of Zoology, Kyoto 
University, Kyoto, Japan (KUZ); National Museum of Natural 


History, Washington D.C., USA (NMNH) Museum of 
Comparative Zoology, Harvard University, Cambridge, USA 
(MCZ); California Academy of Sciences, San Francisco, USA 
(CAS); National Museum of France, Paris, France (MHNP); and 
Zoological Museum of Lomonosov Moscow State University, 
Moscow, Russia (ZMMU). 


DNA sequencing and genetic divergences 

We extracted total genomic DNA from the newly collected 
tissues of J. chapaensis from China (KIZ 034923 and KIZ 
034921), topotypic J. chapaensis (ZMMU NAP-01911), and 
topotypic J. yunnanensis (CAS 242271), using a guanidine 
thiocyanate extraction protocol. A fragment of 432 bp of the 
mitochondrial NADH dehydrogenase subunit 2 (ND2) gene was 
targeted and amplified using the primers and protocols of 
Macey et al. (2000). Previously published sequence data ND2 
for congeners J. splendida and J. flaviceps were obtained from 
GenBank (Accession Nos.: AF128500, AF128501). Novel 
sequences were deposited in GenBank (Accession Nos.: 
MG214260-MG214264). Uncorrected genetic distances were 
calculated using PAUP v. 4.0 (Swofford, 2002). 


RESULTS AND DISCUSSION 


Genetic distance and morphological characteristics of the 
southern Yunnan population 

For the ND2 fragment analyzed, the two individuals from 
southeastern Yunnan Province are genetically identical to each 
other. The southern Yunnan individuals are 1.8% divergent from 
the topotypic J. chapaensis, 5.5% from the topotypic J. 
yunnanensis, 14.896 from J. splendida, and 18.296 from J. 
flaviceps (Table 1). 


Table 1 Uncorrected genetic distances among members of sampled Japalura sensu lato using a fragment of 432 bp ND2 gene 





Japalura Japalura Japalura Japalura chapaensis ` Japalura chapaensis 
splendida flaviceps yunnanensis (topotype) (southern Yunnan) 
Japalura splendida - 
Japalura flaviceps 0.14977 - 
Japalura yunnanensis 0.18433 0.16359 - 
Japalura chapaensis (topotype) 0.17972 0.15668 0.0576 - 
Japalura chapaensis (southern Yunnan) 0.18203 0.14747 0.0553 0.01843 - 


GenBank accession Nos. are listed in the methods. 


Detailed morphometric and pholidosis characteristics are 
presented in Table 2. Morphological characteristics of the 
individuals from southern Yunnan resemble the diagnoses of 
J. chapaensis proposed by Ota (1992) (characteristics of the 
type and topotype from Ota [1992] are given in 
parentheses), SL 7 or 8 (7 or 8), IL 7-9 (7), MD 35-41 (35- 
37) and T4S 27-30 (28-30). Several morphological 
characteristics measured for the newly discovered 
population fall outside the currently recognized range of J. 
chapaensis, including relative length of Toe IV T4L/SVL 
21.31%—-22.35% (23.7%-26.0%), relative fore-limb length 


FLL/SVL 48.14%-50.79% (vs. 50.3%-54.7%), and relative 
hind limb length HLL/SVL 74.66%-77.81% (vs. 80.9%- 
81.4%). However, all characters closely match those of Ota 
(1992), with the same level of intraspecific phenotypic 
variation observed for other lineages in the genus Japalura 
sensu lato (Ota, 2000; Wang et al., 2017). With only two 
specimens known previously for J. chapaensis, it is 
expected that distinct populations of the species would 
possess some degree of variation among phenotypic 
characters. 
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Table 2 Morphological and pholidosis characteristics of Japalura chapaensis 





Catalog No. KIZ 034922 KIZ 034921 KIZ034923 MHNP 1948.45 KUZ 20097 
Sex M F F (subadult) F M 

Type Status - - - Holotype Topotype 
SVL 67.10 68.50 50.80 59.60 58.10 
TAL 168 157 109 122 (?) / 

HL 21.00 22.80 17.20 19.30 19.40 
HW 13.80 14.40 11.30 / / 

HD 12.80 12.70 9.60 / / 

SEL 8.60 8.80 6.60 8.80 8.70 
IOD 12.50 10.70 9.70 11.60 10.00 
FLL 32.30 33.80 25.80 30.00 31.80 
HLL 50.10 53.30 38.50 48.00 47.30 
T4L 15.00 14.60 10.90 14.10 15.10 
TRL 30.90 29.70 23.40 27.20 28.20 
TAL/SVL 249.63% 228.61% 214.37% 204.70% / 
HL/SVL 31.30% 33.28% 33.86% 32.38% 33.39% 
HW/HL 65.71% 63.16% 65.70% / / 
SEL/HL 40.95% 38.60% 38.37% 45.60% 44.85% 
FLL/SVL 48.14% 49.34% 50.79% 50.34% 54.73% 
HLL/SVL 74.66% 77.81% 75.79% 80.54% 81.41% 
TRL/SVL 46.05% 43.36% 46.06% 45.64% 48.54% 
SL 8/8 7/7 8/7 7 8 

IL 8/8 8/9 8/7 7 7 

SOR 3/3 3/3 3/3 3 / 

NSL 0 0 0 0 0 

MD 35 41 36 35 37 

F4S 23/23 25/26 23/24 22/23 24/22 
TAS 27/28 28/29 30/30 27/28 30/30 
PTS 2/2 2/1 2/2 / / 

PTY 3/3 3/2 3/2 / / 

PRS 6/7 9/7 10/8 / / 


Abbreviations are listed in the methods. Data of the holotype and topotype male were obtained from literature (Bourret, 1937; Ota 1989, 1992). “/” 
indicates missing data from the literature, and "—" indicates an incomplete tail. Tail of the holotype was recorded as complete, with its length noted in the 
original description; but it was shown to be incomplete in re-descriptions by Ota (1989, 1992). 


Based on genetic and morphological data, we feel confident in 
the identity of the southern Yunnan population of Japalura as J. 
chapaensis; a new record for agamid lizards of China. To facilitate 
future study of this rare forest agamid and closely related species, 
we provide a detailed description of the newly collected individuals. 


Description of specimens from Yunnan (Figure 2) 

KIZ 034922, adult male; KIZ 034921, adult female; KIZ 034923, 
subadult female; all collected by Yufan Wang from Lvchun, 
southern Yunnan, China. 
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Description of southern Yunnan population.—Body size 
moderate, SVL 67.1 mm in adult male, 68.5 mm in adult female. 
Tail slender, long, TAL/SVL 249.63% in adult male, 228.6196 in 
adult female, 214.37% in subadult female. Relative head length 
moderate, HL/SVL 31.30% in male, 33.28% in female, 33.86% 
in subadult female; HW/HL 65.71% in male, 63.16% in female, 
65.7096 in subadult female. Snout blunt, rounded; rostral 
rectangular, three times longer than height; nasal large, 
somewhat oval in shape, separated from rostral by single scale, 
in contact with supralabials; SL 7 or 8, elongated, weakly keeled; 
three suborbital scale rows from inferior orbit circle to sixth 





Figure 2 Photographs of (1) dorsal body view, (2) ventral body view, and (3) lateral head close-ups of male (A1—3; KIZ 034922) and female 
(B1-3; KIZ 034921) Japalura chapaensis in life (Photographs by Ke Jiang) 


supralabial, middle row much enlarged; supraciliaries 6 or 7, 
imbricate, overlapping one-fifth to one-third of posterior one; 
numerous large, keeled scales forming single ridge from 
postorbital to  supra-anterior tympanum on each side; 
tympanum concealed, covered by fine scales; two or three 
large conical scales posterior to tympanum. Mental pentagonal. 
IL 7-9, smooth; numerous large, conical postrictal scales 
present. Dorsal head scales heterogeneous in size and shape, 
strongly keeled; three large, hexagonal, conical scales arranged 
in triangular position on dorsal snout; interparietal scale 
elongate, hexagonal, parietal eye present; two pairs of distinct, 
conical scales present on raised areas posterior to interparietal 
on each side, forming w-shape ridge anterior to beginning of 
nuchal crest; single, large, conical, postorbital scale posterior to 
last supraciliaries; keeled, sub-pyramidal scales on occipital 
region 2 or 3, with one distinctively tall, enlarged. Ventral head 
scales keeled, more greatly keeled posteriorly, heterogeneous in 
size except for a few irregularly scattered large scales, particularly 
on lateral surfaces; gular pouch present, distinct in life; 
longitudinal gular fold present in life; transverse gular fold absent. 


Dorsal body scales strongly keeled, heterogeneous in size 
and shape; scales of axillary region smaller, circular shaped; 
shoulder fold present, short, weakly defined; enlarged scales 
scattered on dorsal body, moderately raised, each bearing a 
single distinct keel; some enlarged scales arranged in 
paravertebral row close to vertebral crest; mid-dorsal crest 
scales 35-41, larger than neighboring scales; nuchal crest 
scales 5-8, greatly enlarged, triangular shaped; dorsal crests 
relatively lower, serrated. Ventral body scales keeled, largely 
homogeneous in size; enlarged scales scattered ventrolaterally. 
Limb scales keeled, largely homogeneous in size on fore-limbs, 
more heterogeneous in size on hind limbs; enlarged, keeled 
scales scattered irregularly on posterior hind surfaces and crus. 
Tail slender, scales strongly keeled. 


Coloration in life (Figure 2) 

Color codes follow Kóhler (2012). Both males and females of J. 
chapaensis are sexually dichromatic. For males, the background 
color of head varies from chamois (code 84) to olive sulphur 
yellow (code 90). Four dark brownish olive (code 127) to dusky 
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brown (code 285) transverse bands span the dorsal surface of 
the head, two of which are located on the snout, one between 
the eyes and one posterior to the eyes. The background 
coloration of the lateral surfaces of the head are light buff (code 
2) to light flesh color (code 250). Nine dusky brown (code 285) 
stripes radiate out around each eye. The two posterior-directing 
radial stripes are the broadest, with the superior one extending 
from the posterior corner of the eye to the anterior tympanic 
region, and the inferior one extending to the corner of the 
mouth. The subocular radial stripes do not extend outside of the 
circular orbit. Three to four dusky brown (code 285) blotches 
are scattered on both the supralabials and infralabials. The oral 
cavity and tongue are dark spectrum yellow (code 79). 

The background color of the neck and axillary region is light 
flesh color (code 250), which transitions to olive sulphur yellow 
(code 90), yellow green (code 103), and eventually citrine (code 
119) posteriorly. Most regions of the dorsal and lateral surfaces 
of the body possess dusky brown (code 285) reticulated 
patterns around the axillary region and blotches along the 
dorsal midline. An irregularly shaped dorsolateral stripe is 
present on each side of the body, consisting of three Pratt's 
rufous (code 72) colored, elongated blotches, running 
posterolaterally away from the vertebral midline. The coloration 
of these stripes can change to chamois (code 84) depending on 
the condition or mood of the lizards. Five small, sulphur yellow 
(code 80) blotches are evenly scattered along the dorsal 
midline from the shoulder to the pelvis. The dorsal surfaces of 
the limbs are olive sulphur yellow (code 90), with numerous 
dusky brown (code 285) bands spanning the upper surfaces of 
the limbs to the digits. A pale buff (code 1) stripe is present on 
the posterior surfaces of the upper hind limbs. The tail is pale 
buff (code 1), with six citrine (code 119) bands spanning about 
one-third of its total length. The bands of the tail extend to the 
ventral surface, forming complete rings around the tail. The 
remaining posterior surfaces of the tail are uniform ground 
cinnamon (code 270) in color. 

The ventral surface of the head is pale buff (code 1), with 
some dusky brown (code 285) speckles scattered randomly. A 
distinct dark spectrum yellow (code 78) gular spot is present in 
the center of the gular region. The background color of the 
ventral surface of the body and limbs is pale horn (code 11) 
anteriorly, with the coloration changing to light pale pinkish buff 
(code 3) posteriorly, including the upper ventral surfaces of the 
limbs and the ventral surfaces of the tail. Grayish horn (code 
268) speckles are scattered across the ventral surfaces of the 
body and limbs. 

Female body coloration differs from male coloration 
(described above) as follows. The dorsal surfaces of the head 
and neck are mostly uniform clay (code 18) in coloration. The 
lateral surfaces of the head are yellow-green (code 103), with 
two to four clay (code 18) colored radial stripes extending from 
around the eyes. Five rectangular clay (code 103) blotches are 
scattered evenly along the dorsal surface of the body at the 
midbody, with all blotches loosely connected along the dorsal 
midline. The background color of the lateral body surfaces, 
dorsal limb surfaces, and dorsal tail surface are pistachio (code 
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102). A pale buff (code 1) stripe is present posteriorly on each 
upper hind limb. Clay (code 18) bands are present on the first 
one-third of the tail; however, the bands extend to the lateral 
surfaces of the tail only, and do not form complete rings. The 
posterior two-thirds of the tail is uniform clay (code 18). 

The ventral surface of the head is uniform light pistachio 
(code 101) with no speckles. This coloration transitions 
gradually to dark spectrum yellow (code 79) toward the center 
of the gular region, forming a gular spot. The ventral surfaces of 
the body, limbs, and tail are pale green (code 99). 


Ecology and distribution 

Japalura chapaensis is arboreal, inhabiting low shrubs in 
tropical forest, particularly near streams and along forest edges 
(Figure 3). In Lvchun, individuals are usually found resting on 
fern leaves at night. Currently, the species is known from 
Lvchun County of Honghe Prefecture, Yunnan Province, China, 
and from Cao Bang, Hai Duong, and Lao Cai Provinces of 
Vietnam (Ananjeva et al., 2007). In addition, morphologically 
similar individuals have been photographed in Daweishan 
Natural Reserve of Honghe Prefecture, 130 km east from 
Lvchun County, which suggests that the species has a wider 
geographic distribution along the China-Vietnam border (Jian 
Wang and Shuo Qi, personal communications). 


Diagnoses from congeners 

Previous studies have compared J. chapaensis with five island- 
congeners (J. brevipes, J. makii, J. mitsukurii [synonym of J. 
swinhonis], J. polygonata xanthostoma, and J. swinhonis), and 
two mainland-congeners (J. splendida and J. yunnanensis) 
(Ota, 1989; Ota & Weidenhófer, 1992). Based on their results, 
previous authors concluded that J. chapaensis is morphologically 
most similar to J. brevipes, J mitsukurii, J. swinhonis, and J. 
yunnanensis, but can be distinguished from these lineages by 
its relative head length, relative snout-eye length, Toe IV length, 
and Toe IV subdigital lamellae and middorsal crest scale counts 
(Ota, 1989; Ota & Weidenhófer, 1992). 

With the addition of the new collections from southern 
Yunnan and a more comprehensive examination of congeners, 
we found that J. chapaensis can be readily differentiated from 
all five Taiwanese species by having much larger nuchal crest 
scales and a distinct head shape with a w-shaped lump on the 
occipital region anterior to the nuchal crest. In addition, we 
found J. chapaensis to be morphologically most similar to J. 
yunnanensis, with both species lacking a transverse gular fold, 
and possessing concealed tympanum, large nuchal crests, 
irregularly distributed and enlarged ventrolateral scales, dark 
spectrum yellow gular spots in males (sometimes in females), 
and jagged dorsolateral stripes that are oriented in a 
posterolateral direction away from the dorsal midline. However, 
although phenotypically similar, Japalura chapaensis differs 
from J. yunnanensis by having a tendency toward fewer mid- 
dorsal crest scales (MD 35—41 vs. 39-46), and relatively shorter 
tails (TAL/SVL 249.63% in male, 204.70%—228.61% in females 
vs. 255.19%—288.81% in males, 237.45%-—259.67% in females) 
(Table 2). 





Figure 3 Habitat of Japalura chapaensis in Lvchun, southeastern Yunnan Province, China (Photograph by Yu-Fan Wang) 


For the remaining mainland congeners from non-Himalayan 
regions, J. chapaensis differs from J. fasciata by having a 
longer tail (TAL >204% SVL vs. <180%) and longer hind limbs 
(HLL >74% SVL vs. <69%), and by the absence of a transverse 
hourglass-shaped marking on the dorsum (vs. present); from J. 
batangensis, J. brevicauda, J. dymondi, J. flaviceps, J. grahami, 
J. iadina, J. laeviventris, J. splendida, J. vela, J. yulongensis, 
and J. zhaoermii by having significantly enlarged nuchal crest 
scales (vs. not significantly differentiated from dorsal crest 
scales), bright yellow oral cavity (vs. flash color or blackish blue), 
and by the absence of a transverse gular fold (vs. present); 
from J. varcoae by having a concealed tympanum (vs. exposed) 
and enlarged, differentiated nuchal crest scales (vs. not 
significantly differentiated); and from J. micangshanensis by 
having significantly enlarged nuchal crest scales (vs. not 
significantly differentiated from dorsal crest scales) and by the 
presence of gular spots (vs. absent). 

In conclusion, Japalura chapaensis can be diagnosed from its 
congeners on the basis of the following set of characters: (1) 
body size moderate SVL 58.10—68.50 mm; (2) tail relatively 
long TAL/SVL 249.63% (male), 204.7096—228.6196 (females); 
(3) limb length moderate, FLL/SVL 48.14%-54.73%, HLL/SVL 
74.66%-81.41%; (4) short, w-shaped ridge present on occipital 
head anterior to nuchal crest; (5) tympanum concealed; (6) 
supraciliaries overlapping less than one-third of posterior one; 
(7) NSL absent (8) lateral gular fold and gular pouch present, 
well developed; (9) transverse gular fold absent; (10) nuchal 


crest scales large, crest height/HL 10.48%-14.04%; (11) MD 
35-41; (12) T4S 28-30; (13) ventral scales heterogeneous in 
size with enlarged scales scattered ventrolaterally; (14) ground 
body coloration camouflaged with yellow-green, clay, and dusky 
brown coloration in life; (15) dorsolateral stripes present in 
males, jagged, sometimes discontinuous, posterolaterally away 
from the dorsal midline; (16) gular spot present, distinct in 
males, somewhat faded in females, dark spectrum yellow in 
both sexes; and (17) oral cavity and tongue dark spectrum 
yellow in life. 

Despite our discovery of a new population of J. chapaensis in 
southern China, the small number of individuals seen in the wild 
continues to limit our understanding of this rare Japalura 
species in northern Indochina. Future surveys and greater 
international collaborations throughout this region are needed to 
clarify the morphological variation, distribution patterns, and 
population-level genetic diversity of J. chapaensis. 
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APPENDIX 

Specimens examined 

Japalura batangensis (n=16): CIB 1902-1908, 2227, 2233, 2243, KIZ 84011, 
801081, Batang, Sichuan, China; KIZ 09404, 019311, 019312, KIZ 019314, 
Mangkang, Tibet, China. 

Japalura dymondi (n-7): CIB 1869, 87234, Panzhihua, Sichuan, China; KIZ 
9511001, 1002, 1016, 1018, 1022, Dayao, Yunnan, China. 

Japalura fasciata (n=3): CIB 2620, 2621, 2622, Pen Hsien, Sichuan 
Province, China. 

Japalura flaviceps (n-13): CIB 2234, 2332, 2333, 2341, 2354, 2355, 2549, 
2554, 2556, 2561, 2567; KIZ 05181, 05182; Luding, Sichuan, China. 
Japalura grahami (n-1): USMN 65500 (holotype), Yibin, Sichuan, China. 
Japalura makii (n-2): MCZ R-172743 (paratype), R-181443, Taiwan, China. 
Japalura micangshanensis (n-9): CIB 86348, 86351, Xianyang, Shaanxi, 
China; CIB 86356, 86357, 86360, 86361, Luonan, Shaanxi, China; CIB 2572, 
2578, 2582, Wenxian, Gansu, China. 

Japalura polygonata (n-3) CAS 21215, 21243, 21244, Kagoshima 
Prefecture, Japan. 


Japalura splendida (n=6): USNM 35522 (holotype), Yichang, Hubei, China; 
CIB 2588, 2591, 2596, 72468, 72469, Chongqing, China. 


Japalura swinhonis (n=4): CAS 18085, 18089, 18098, 18099, Taiwan, China. 


Japalura varcoae (n=3): CIB 2650, 2651, KIZ 85110006, Kunming, Yunnan, 
China. 

Japalura vela (n=11): KIZ 013801 (holotype), KIZ 013802, 013813, 013800, 
013805-013811 (paratopotypes), Jerkalo, Tibet, China. 


Japalura yunnanensis (n-8): CIB 2684, 2686, 2687, 2689, KIZ 82081, 
Longling, Yunnan, China; KIZ 74110240, 0248, 791469, Tengchong, Yunnan, 
China. 

Japalura zhaoermii (n-14): CIB 2690 (holotype), 86432, 86435, 85721, 
85722, 86433, 86434, 86436, Wenchuan, Sichuan, China; CIB 2232, 2244, 
2240, KIZ 84032, 85030, Lixian, Sichuan, China. 
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ABSTRACT 


Echolocating bats have developed advanced 
auditory perception systems, predominantly using 


acoustic signaling to communicate with each other. 


They can emit a diverse range of social calls in 
complex behavioral contexts. This study examined the 
vocal repertoire of five pregnant big-footed myotis bats 
(Myotis macrodactylus). In the process of clustering, 
the last individual to return to the colony (LI) emitted 
social calls that correlated with behavior, as recorded 
on a PC-based digital recorder. These last individuals 
could emit 10 simple monosyllabic and 27 complex 
multisyllabic types of calls, constituting four types of 
syllables. The social calls were composed of highly 
stereotyped syllables, hierarchically organized by 
a common set of syllables. However, intra-specific 
variation was also found in the number of syllables, 
syllable order and patterns of syllable repetition across 
call renditions. Data were obtained to characterize the 
significant individual differences that existed in the 
maximum frequency and duration of calls. Time taken 
to return to the roost was negatively associated with 
the diversity of social calls. Our findings indicate that 
variability in social calls may be an effective strategy 
taken by individuals during reintegration into clusters 
of female M. macrodactylus. 


Keywords: Social calls; Clustering; Big-footed myotis 
INTRODUCTION 


Animals have evolved a variety of communication systems 
across all sensory modalities, such as visual, chemical, 
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electrical, and acoustic signals. Of the communication 
modalities, acoustic signals have the advantage of 
broadcasting information for longer distances with less 
interference from physical barriers. This advantage was likely 
a driving force in the evolution of acoustic communication 
in a wide range of taxa (Seyfarth et al., 2010; Wilkins et al., 
2012). Acoustic communication fulfills an important function 
in social interactions, especially for bats, who are among 
the most diverse and gregarious of all mammals and live in 
predominately dark environments (Kerth, 2008). The acoustic 
communication systems observed in echolocating bats are 
extremely complex in the mammalian world because they can 
emit two kinds of calls: echolocation and social calls, which 
serve different functions (Kanwal, 2009, 2012). Social calls are 
mainly used to transmit information under special behavioral 
contexts. Many bat species demonstrate a diverse repertoire of 
social calls under a variety of behavioral conditions (Bohn et al., 
2008; Clement et al., 2006; Gadziola et al., 2012; Knórnschild 
et al, 2010; Ma et al, 2006). These characteristics of 
bats provide a unique opportunity for research on acoustic 
communication of animal sociality. 

The motivation-structure hypothesis states that social 
call types are decided by the behavioral context of birds 
and mammals (Kanwal, 2009; Morton, 1977). In bats, the 
behavioral settings of their vocalizations include aggressive 
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or warning, courtship, communication, distress, mother-infant 
recognition, and clustering behaviors. Intra-specific 
communication under specific behavioral contexts usually 
carries significant information about the signaler (Behr et 
al., 2009; Behr & von Helversen, 2004; Knórnschild and 
von Helversen, 2008). Under some behaviors, intra-specific 
differences and elaborate sequences of acoustic signals are 
of importance, and contribute to accurate mother-infant and 
individual recognition within a colony. For instance, songs of 
free-tail bats contain multiple elements (e.g., syllables, notes 
and/or phrases) with highly structured, rather than random, 
arrangement. Males often modify types of calls through 
modulating the repeated syllables, which can differ among 
individuals (Bohn et al., 2009). Clustering behavior is another 
important context in which bats emit social calls. Mammals 
living in low-temperature ambient environments often huddle 
together to reduce energy consumption and promote member 
fitness (Gilbert et al., 2010). The central temperature of a 
cluster is higher, and the central site is safer than the border 
position (Hatchwell et al., 2009). Thus, achieving the central 


position is often the objective of an individual within a cluster. 


Acoustic signals containing characteristic information may 
contribute to accurate recognition, which can help individuals 
save energy and time during contention for the central cluster 
site (Dugatkin and Earley, 2004). Although several studies 
have examined the clustering behavior of bat species (Kerth, 
2008), there have been no studies on the diversity and 
structural characteristics of acoustic communication that occur 
during the clustering of gregarious bats. 

Big-footed myotis, Myotis macrodactylus, belongs to the 
subgenus Leuconoé of the genus Myotis (Yangochiroptera, 
Vespertilionidae). Sexual segregation in colonies is known 
to occur in many bat species, with separate clustering of 
females during the summer pupping season (Bradbury, 
1977; McCracken and Wilkinson, 2000). Females rear their 
young communally with the formation of maternity colonies, 
whereas males are solitary or form small bachelor groups 
(Senior et al., 2005). The clustering of pregnant bats is a 
cooperative behavior that permits individuals involved in social 
thermoregulation to minimize heat loss and thereby lower 
their energy expenditure, and may allow them to reallocate 
the saved energy to other functions such as growth and 
reproduction (Gilbert et al., 2010). Thus, the cluster center is 
the optimal position for pregnant bats, and may assist better 
fetal development. According to our observations, the last 
individuals (LI) returning to a group will usually try to enter 
the central position of the cluster. Approaching behaviors 
are always accompanied by social calls produced by the LI 
until physical contact with the other bats. Thus, we assumed 
that the social calls emitted by the LI attempting to enter the 
cluster center are related to this special behavior. We therefore 
examined two questions: (1) Are these social calls associated 
with successful entrance into the center of the cluster? (2) 
Does the variety or structure of the social calls influence this 
behavior? We recorded and analyzed the social calls emitted 
by M. macrodactylus under this behavioral context. 


MATERIALS AND METHODS 


Acquisition and maintenance of animals 

The study was conducted from May to June 2010. Five 
pregnant M. macrodactylus individuals were randomly chosen 
from 90 pregnant bats in a natural cave in Jilin Province, China. 
They were banded and roosted together as a colony in a 
temperature- and humidity-controlled animal room (20 mx10 
mx4 m) at Northeast Normal University in Changchun, China. 
The day-night cycle was controlled at 12 h dark and 12 h light. 
They could fly freely within the animal facility and were given 
water and mealworms ad libitum. Before the experiment, all five 
subjects were habituated to handling and to the testing cage. 


Acoustic recordings 

We recorded and analyzed the social calls and associated 
behaviors of the captive bats. To optimize recording quality, the 
five animals were housed in a small testing cage (200 cmx80 
cmx100 cm) in a room lined with anechoic foam, separate from 
the animal holding room. To reduce environmental interference 
with the ultrasonic calls, environmental temperature and 
humidity of the room were kept constant. 

Vocalizations were recorded using two ultrasonic condenser 
microphones (CM16, Avisoft Bioacoustics, Berlin, Germany) 
connected to amplifiers and A/D converters (UltraSoundGate 
416 H). The microphone had a flat frequency response 
from 10-250 kHz. The gain of each microphone was 
independently adjusted to optimize the signal-to-noise ratio 
of the recording. Acoustic signals were digitized at 375 kHz 
with 16-bit depth, and monitored in real time with RECORDER 
software. We used two infrared surveillance cameras (Pensee, 
PIS-322EG) to simultaneously record the behavior of all 
animals. We defined the last animal to join the group (LI) as 
the last individual moving toward the cluster and displaying 
"approaching" behavior (Kobayasi et al., 2012). In most cases, 
the LI was observed as roosting last within a cluster. Most 
recordings occurred during the clustering process after bats 
had eaten (Figure 1). The characteristics of their social calls 
and the time they took to join the cluster were analyzed 
using Avisoft-SasLab Pro software, Version 5.1.20 (Avisoft 
Bioacoustics, Berlin, Germany). Time here was defined as 
consumed time (CT). The study was conducted for 38 d from 
2100 h—0100 h each night. 


Analysis of vocalizations 

We examined the social call sequences of the Ll using 
nomenclature similar to Kanwal et al. (1994) and Bohn et al. 
(2008), with a syllable considered the smallest acoustic unit of 
a vocalization defined as one continuous emission surrounded 
by background noise, and a call considered the simplest 
emission pattern of a vocalization used for communication 
consisting of one or more elements (syllables). Vocalizations 
were semi-automatically analyzed using Avisoft-SasLab Pro 
software (Version 5.1.20 Avisoft Bioacoustics). Syllable start 
and end times were automatically detected using a standard 
threshold level (596) and hold time (2 ms), and manually 
adjusted when necessary. Signals were high-pass filtered 
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Figure 1 Schematic showing the cage set-up for simultaneous audio and video recordings of captive M. macrodactylus 


A: Testing was conducted in a wire mesh cage (200 cmx80 cmx100 cm). Two ultrasonic microphones (UltraSoundGate CM16) were placed outside the cage 


and pointed at the center of the ceiling of the cage, which was about 50 cm above the tip of the microphone. Two video monitors were used to monitor and record 


behaviors. B: Cluster of bats (illustrated as solid circles) and approaching bat (dotted circle), both in a bottom-up view. The sketch, taken from a sequence of 


video frames, shows the LI approaching the colony after foraging. 


at 1 kHz and the amplitude of each pulse was normalized 
to 0.75 V (Bohn et al., 2008; Gadziola et al., 2012; Kanwal 
et al., 1994). As a result, social call sequences were of 
similar amplitude and unwanted echolocation signals were 
removed. For analyses, we randomly selected three calls per 
call type from all individuals that were of sufficient quality for 
measurements and syllable identification. The social calls were 
automatically analyzed with the Avisoft-SasLab Pro software 
(Version 5.1.20 Avisoft Bioacoustics). Spectral and temporal 
analyses were obtained from the spectrogram and oscillogram 
with a Fast Fourier Transform (FFT) length of 512 points 
(87.5% overlap), which allowed for a frequency resolution of 
732 Hz and temporal resolution of 0.171 ms. We measured 
the following parameters for each social call (Figure 2): total 
duration of call (totdur), minimum (fmin) and maximum (fmax) 
frequencies of call, and mean frequency of highest energy 
(freq). This was done by measuring the peak frequency of 
each component (and then calculating average); number of 
components to call (nocomp); and number of different syllables 
(Difptype) in each call (Russ et al., 2004; Russo & Jones, 
1999). If more than one harmonic was present, values of the 
first (fundamental) harmonic were taken. Additionally, we used 


oscillograms and spectrograms to identify syllables within calls. 


We also examined the composition of each call in terms of the 
total number of each syllable type and the proportion of calls 
with syllables. Finally, for each call recorded, we assigned a 
call variant and call type based on its sequence of syllables. 
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Statistical analyses 

Call parameters were checked for normality using the 
Kolmogorov-Smirnov test. Since all call parameters were 
normally distributed, differences in call characteristics between 
individuals were compared using one-way ANOVA, followed 
by Tukey multiple-comparison tests. Discriminant function 
analysis (DFA) was used to determine to which individual 
certain social calls should belong. We examined the general 
relationships between the variables across all the Lls, including 
call variants and CTs, by performing Spearman correlations. 
We used a simple linear regression analysis to assess the 
statistical significance of call variation as a predictor of 
individual recognition of CT in the process of approaching 
clusters. All statistical analyses were performed using SPSS 
15.0 statistical software (SPSS Inc., Chicago, IL, USA). The 
alpha level was set to 0.05 and data were presented as 
means+SD. 


RESULTS 


Acoustic structure of calls 

A total of 885 call sequences were analyzed from five 
individuals when the LI approached the cluster. We 
distinguished a total of four syllable types, which could be 
combined to form 10 simple monosyllabic calls and 27 complex 
multisyllabic calls. The four syllable types and their 37 
combinations are outlined in Figure 3 and Figure 4: downward 
frequency-modulated syllable (DFM; Figure 3A), short quasi-V 
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totdur 


Time (s) 


Figure 2 Spectrograms (center) of social calls of Myotis macrodactylus showing the change of frequency with time 


Oscillogram (top) gives the change of amplitude with time. Mean frequencies (left part of spectrum) are the average amplitude of different frequencies of the 


measured time window. Parameters evaluated in this study are shown. 
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0.02 0.04 


Figure 3 Echolocation pulse (FM) and simple syllables 
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Oscillogram (top) of the syllable(s) with the corresponding spectrogram (bottom). Syllables are: A, downward frequency-modulated syllable (DFM); B, short 


quasi-V frequency-modulated syllable (VFM); C, arched frequency-modulated syllable (AFM); D, sinusoidal frequency-modulated syllable (SFM). 


frequency-modulated syllable (VFM; Figure 3B), arched 
frequency-modulated syllable (AFM; Figure 3C) and sinusoidal 
frequency-modulated syllable (SFM; Figure 3D). In the syllable 
sequence (Figure 5), social calls started with DFM syllables, 
which, in turn, usually gave way to VFM syllables. The VFM 
syllables were followed by SFM syllables, usually at the end of 
a call. The AFM syllables were only interjacent in the sequence 
and only made transition with DFM syllables. Typically, SFM 
syllables had a longer duration (10.4+2.8 ms) than that of the 
other syllable types (DFM, 1.6+0.7 ms; VFM, 4.2+0.8 ms; AFM, 
3.5+0.8 ms). 


Individual acoustical differences 
To establish whether the social calls were individual-specific, 


we performed a DFA using four spectrotemporal and two 
syllable parameters (Table 1). This analysis resulted in two 
canonical functions with eigenvalues greater than 1, which 
together explained 90.1% of the variation in the calls from the 
five bats (Figure 6). Subsequent univariate ANOVA showed 
that the first and second canonical functions could be used 
to differentiate between social calls of individual bats (PC1: 
F4, 67=36.6, P«0.001; PC2: F4,67213.4, P«0.001). Maximum 
frequency and duration were the most distinctive features of 
these calls, indicating different acoustic repertoires. These 
analyses suggested that social calls from different individuals 
should be distinguishable by conspecifics. 
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Figure 4 Spectrograms and oscillograms of multi-element social calls from female Myotis macrodactylus 


Figure 5 Model of calls based on transition syllables 
Arrows represent transitions from one syllable to the next. Arrow thickness increases with transition frequencies that deviate from the expected by more than 
10%, 30%, or 50%. Dotted lines represent the end of a call. See Figure 4 for examples of each call. 
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Table 1 Social call parameters from Myotis macrodactylus females 





























Bat Call (n) Variants totdur (ms) freq (kHz) fmax (kHz) fmin (kHz) Pulses (per 0.1s) Difptype 
1 33 (3) 4 46.6+10.5 b 49.4+5.2a 100.4+6.7 a 23.6+3.1 b 5.5+1.2a 1.040.0a 
2 51 (4) 7 37.8+7.9 a 49.6+9.3 a 99.1+8.5a 25.5+3.6 a 4.9+0.8 a 1.40.6 b 
3 120 (5) 12 51.1+15.3 bc 39.8+4.3 c 78.4+8.1 d 18.0+3.1 e 7.4+2.3 © 1.10.2 a 
4 297 (11) 20 52.0+14.1 c 46.2+5.2 b 93.1+9.6 b 20.6+2.9 c 7.2+2.1 6 1.2+0.4 a 
5 384 (15) 29 49.7+10.6 bc 45.8+4.6 b 87.0+7.5 c 19.2+2.8 d 6.51.5 b 1.40.7 b 
F 14.94* 50.59* 105.43* 79.39* 26.37* 18.37* 





“n” is the total number of times the bat was LI, restricting analysis to three calls per call type. Same letter (a, b, c, d, e) indicates not significantly 


different from each other (Tukey multiple-comparison test). *: P<0.001. 





Canonical function 2 
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Canonical function 1 


Figure 6 Distribution of social calls from five females (marked by different symbols) on first two canonical axes 


Function 1 was most correlated with maximum frequency and function 2 was most correlated with duration. Large circles enclose areas with 75% probability of 


including calls from each bat. 


Call variants and cluster success 

The clustering success of females was measured as the 
time taken to join a cluster (CT). We correlated the CT 
with the number of call variants emitted during one complete 
process of returning to the colony. Because call variants were 
most influenced by type and number of syllables, a negative 
correlation between the call variants of the LI and cluster 
success was derived (Figure 7). Clustering success tended 
to increase with the total number of call variants produced. The 
number of social calls for the five individuals varied from 4 to 29 
during the entire research session. We found that call diversity 


was positively correlated with clustering efficiency. 
DISCUSSION 


Intra-species variability of LI calls was found in the number 
of syllables, syllable order and syllable repetition across 
call renditions. These data confirmed that differences in 
social calls existed between M. macrodactylus individuals 
during the process of approaching the colony. Our results 
further confirmed the hypothesis that social call richness 
was significantly negatively correlated with time taken for the 
successful reunion of M. macrodactylus females in a cluster. 
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Figure 7 Relationship between call variants and time taken to join a cluster in Myotis macrodactylus 


Solid line depicts linear regression. 


The LI can emit 10 simple monosyllabic and 27 complex 
multisyllabic call types composed of four syllable types. 
In previous studies, simple monosyllabic calls have been 
recorded in Myotis species at maternity roosts before bats 
leave and after they arrive (Pfalzer & Kusch, 2003), and 
usually begin with downward frequency-modulated signals. 
In mammals, repeated calls in communication signals can 
indicate a general coding rule (Fischer et al., 2001; Schehka et 
al., 2007). However, complex multisyllabic calls in Pipistrellus 
pipistrellus constitute different syllables, which may attract 
conspecifics and thereby help to deter predators (Budenz et 
al., 2009). We found that complex multisyllabic calls were 
emitted by M. macrodactylus females returning to the colony; 
furthermore, the diversity of calls was affected by syllable type 
and unique syllable types led to specific calls. New evidence 
has suggested that Megaderma lyra can regularly modify the 
structure of their calls, including the number and repetition of 
syllables, according to competing strength (Bastian & Schmidt, 
2008), and signalers may convey their own properties in 
acoustic signals (JanBen and Schmidt, 2009). 

The sequences of pulses in M. macrodactylus female calls 
demonstrated a stable pattern. The social calls always 
began with a downward frequency-modulated component and 
were followed with one or two other sub-syllables; by adding 
syllables or types of syllables based on this rule, call variants 
were increased. Similar descriptions of calls with high diversity 
have been found in studies on Saccopteryx bilineata (Behr & 
von Helversen, 2004), Tadarida brasiliensis (Bohn et al., 2009) 
and Pipistrellus nathusii (Russ and Racey, 2007; Jahelková 
et al., 2008) courtship songs. These studies noted large 
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inter-individual differences in calls, with males able to emit 
diverse courtship calls, and differences in individual calls 
most evident in regard to maximum frequency and duration. 
Higher diversity in the community may contribute to species 
identification in bats. In our study, individuals emitting low 
diversity calls required more time to join the group, and 
occasionally could not attain the central position. The border 
temperature of a group is lower than that of the central position, 
So bats need more energy consumption for thermogenesis. 
Therefore, the LI must repeatedly communicate with others 
when returning to the colony to obtain maximum benefit. To 
reduce the time taken for the recognition process, social 
behavior may drive the diversity of individual calls. 

We also found that the same calls were emitted by different 
individuals; these calls may function as identification signals 
among members of a colony and have a universal meaning. 
However, variation in the number and type of syllables may be 
due to specific information conveyed by individuals. Huddling 
together in a roost is likely a thermoregulatory strategy used by 
M. macrodactylus to conserve energy and increase survival or 
fitness. In this study, five individuals were randomly selected 
from 90 females, with no significant differences in forearm 
length or body size impacting the results. Each of the five 
bats could potentially be the LI, but the time spent alone 
by each bat was negatively correlated to their call diversity. 
Under natural circumstances, calls may be relevant to status 
identification. The clustering of numerous individuals would 
make diverse calls more identifiable by others. In this study, 
individuals that emitted multiple social calls may be more 
easily identified and accepted by others, such that individual 


calls may evolve towards complexity under natural selection. 
This article suggests a role for the sophisticated acoustic 
communication ability developed by bats. Further research is 
needed to elucidate the process of identification through social 
calls, which can be conducted through multiple methods and is 
affected by behavioral context. 

Our results indicated that the social calls emitted by 
clustering M. macrodactylus females always exhibited a stable 
structure and diversity. Furthermore, differences in social call 
variability between M. macrodactylus females was found, and 
individuals emitting more complex social calls took less time 
to rejoin the cluster. However, further work is needed to 
investigate the specific function of this diversity in social calls. 
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ABSTRACT 


With the wide application of DNA sequencing 
technology, DNA sequences are still increasingly 


generated through the Sanger sequencing platform. 


SeqMan (in the LaserGene package) is an excellent 
program with an easy-to-use graphical user interface 
(GUI) employed to assemble Sanger sequences into 
contigs. However, with increasing data size, larger 
sample sets and more sequenced loci make contig 
assemble complicated due to the considerable number 
of manual operations required to run SeqMan. Here, 
we present the ‘autoSeqMan’ software program, which 
can automatedly assemble contigs using SeqMan 
scripting language. There are two main modules 


available, namely, ‘Classification’ and ‘Assembly’. 


Classification first undertakes preprocessing work, 
whereas Assembly generates a SeqMan script to 
consecutively assemble contigs for the classified 
files. Through comparison with manual operation, we 
showed that autoSeqMan saved substantial time in the 


preprocessing and assembly of Sanger sequences. 


We hope this tool will be useful for those with large 
sample sets to analyze, but with little programming 
experience. It is freely available at https://github.com/ 
Sun- Yanbo/autoSeqMan. 


Keywords: Batch processing; Sanger sequences; 
Contig assembly; SeqMan 


INTRODUCTION 


DNA sequencing technology has experienced a revolutionary 
shift from automated Sanger sequencing (Sanger et al., 1977) 
to next-generation sequencing (NGS; reviewed by Shendure 
& Ji (2008) and Shendure et al. (2004). Although NGS 
has dominated due to its high throughput (Schuster, 2008), 


Science Press 


it is not suitable for many population studies due to high 
costs and other limiting factors. For example, errors are 
always introduced in final assembly and/or annotation results 
using NGS data (Bickhart et al., 2017), and thus variations 
detected in high-throughput analyses require validation by 
Sanger sequencing (Wall et al., 2014). Furthermore, for 
some present population genomic studies, error rates have 
been found to increase with increasing depth of coverage for 
Illumina data, and thus caution is needed when interpreting the 
results of next-generation sequencing-based association studies 
(Wall et al., 2014). As such, Sanger sequencing technology is 
still widely used in many research fields, including evolutionary 
taxonomy based on short DNA sequences (Chen et al., 2017), 
evolutionary history study of wild animals (Yuan et al., 2016), 
biodiversity estimates and influencing factors (Zhou et al., 2017), 
and validation of mutations identified from high-throughput 
analyses (Sun et al., 2013). 

Further, with the wide application of DNA sequencing 
technology, e.g. DNA barcoding, which uses short and 
standardized DNA sequences for individual identification of 
organisms (Hajibabaei et al., 2007; Savolainen et al., 2005), 
Sanger sequencing data are continuing to be accumulated 
among evolutionary taxonomists and others. Thus, batch 
manipulation of these Sanger sequences has become an 
important task before downstream analyses, especially for 
those who doesn't have programming or bioinformatics 
background or experiments. Although several sequence 
manipulation packages for general purpose issues have been 
published previously, including MEGA (Kumar et al., 2016), 
EMBOSS (Rice et al., 2000), and FasParser (Sun, 2017), these 
packages are all based on assembled contigs (a consensus 
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Figure 1 Overview of assembling tasks for Sanger sequences 
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Common tasks for assembling Sanger sequences include (1) classifying sequence files into corresponding folders (always completed manually) and (2) opening 


SeqMan to add the ab1 files that belong to a same sample and then assembling them (needing considerable mouse operations). The autoSeqMan program was 


designed based on these steps, with the 'Classification' and 'Assembly' functions corresponding to step 2 and 3, respectively. 


region of overlapping DNA segments) and no key consideration 
has been taken on the batch assembly of Sanger sequences. 

SeqMan is a popular program in the LaserGene software 
package (DNAStar, Inc., Madison, WI, USA), which is used 
for assembling Sanger sequences into contigs and has been 
widely applied in a great number of studies. It can handle two 
to thousands of Sanger sequences at one time but requires a 
considerable number of manual operations (e.g., mouse actions, 
Figure 1) to run. Hence, it is complicated and time-expensive 
for those with large sets of samples to assemble. Fortunately, 
since the release of Version 7, SeqMan now provides a scripting 
language, including commands for opening, naming, saving, and 
closing projects, and a single script may be used to execute 
multiple assemblies consecutively without manual intervention. 

Here, we developed a program called autoSeqMan, which 
provides a simple way to automatedly classify Sanger 
sequences and then consecutively assemble them on a 
personal computer. It is mainly designed for researchers with 
large sets of samples with one or more loci sequenced. 


IMPLEMENTATION AND REQUIREMENTS 


autoSeqMan was developed into a standalone Windows 


desktop application (compiled and tested in Windows 7/10). 


It involves two modules, ‘Classification’ and ‘Assembly’, 
corresponding to steps 2 and 3 in Figure 1, respectively. Each 
module can handle multiple files and needs the user to select 
the directory either containing the raw Sanger sequence files 
(*.ab1 files) or containing the classified sub-folders created by 
‘Classification’. Theoretically, there is no limit to the number of 
files that can be analyzed. 

This tool requires the sequence files be named in a 
specialized format, in which the sample ID should be present 
at the beginning of the file name. The Classification module will 
recognize the sample ID by the appropriate delimiter and then 
create sub-folders (see below). For convenience, autoSeqMan 
also provides a "Rename" tool to help users rename the ab1 
files for the below analyses. 
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CLASSIFICATION 


This function is designed to automatedly create sub-folders 
according to the sample ID and/or sequenced locus. All 
downstream analyses are performed in the corresponding 
sub-folders, where all analyzed results are also saved. 
According to our laboratory experience, this is an efficient 
and convenient way to manage and query laboratory samples 
(Chen et al., 2017; Zhou et al., 2017). 

The only input is the directory name, which contains the 
raw ab1 files. There are several input prerequisites required 
for Classification performance. First, all files must be stored 
in a same directory. Second, all files must be named 
according to a certain pattern, i.e., "sample-locus-others". 
For example, the file name "YPX24212 168-2215 TSS 
20171122-0871-1171_H02.ab1” denotes that it is a DNA 
sequence of 16S and the sample number is “YPX24212”. The 
program will automatedly recognize the filename according 
to the user-specified delimiter and then create a sub-folder 
"YPX24212 16S" in the main output folder. The delimiter can 
be “-“, " ", or other. After classification, the program will list 
all sub-folders, and the user can look at the files classified into 
each sub-folder by simply clicking the folder name (Figure 2). 


ASSEMBLY 


This function will automatedly assemble the classified 
sequence files. It will first read the list of classified sub-folders 
created by the 'Classification' function, and then generate a 
SeqMan script for consecutively assembling the sequences 
in each sub-folder. To perform this function, the user must 
first install the LaserGene package (version 7 or higher), and 
then tell autoSeqMan the full path of the SeqMan program 
(which can be always recognized automatedly by autoSeqMan), 
after which the program will complete all assembly tasks and 
save the assembly results automatedly. The default script will 
generate all SQD, FAS, and SEQ results (Figure 3). 
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Figure 2 Overview of ‘Classification’ function in autoseqMan 
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To perform this function, user needs to tell autoSeqMan the full path of a directory which contains all raw ab1 files. The sample name should be present at the 


beginning of the file name, which will be recognized and extracted by the autoSegMan according to the specified delimiter. 


newProject 


assemble file:” {mainFolder} \{subfolder}” 


expandDir: false 
optimizeOrder: true 
contamScan: false 
vectScan: true 
trimEnds: true 
doAssemble: true 


exportContigs contigs:all file:” {mainFolder} \ {subfolder} \ {out}. seq” 


format : DNA 
doGaps: true 
doFeature: true 


exportSequences contigs:all file:” {mainFolder} \ {subfolder} \ {out}. fas” 


format: FAS 
doGaps: true 


saveProject file:” {mainFolder} \ {subfolder} \ {out}. sad" 


Figure 3 Default SeqMan script for assembling Sanger sequences 


This script will be generated automated by autoSeqMan. There is no need for users to edit it. In default, all SQD, SEQ, and FAS outputs are generated. 


PERFORMANCE 


The main aim of autoSeqMan is to save manual operation 
in preparing files and running the SeqMan program. To 
evaluate its performance, we applied this tool to our laboratory 
data (Chen et al., 2017; Zhou et al., 2017). In this test, 
one hundred samples were used, each of which had two 
ab1 files available. Results showed that the Classification 
operation created sub-folders (named sample ID as well as 
locus name if provided) and moved the appropriate files into 
the sub-folders within 8 s, substantially less than the time used 
for manual operation (about 1 h, as tested by our colleagues). 
Performance of the Assembly operation greatly depended on 


the running efficiency of SegMan. In this test, the Assembly 
module required 64 s to consecutively assembly contigs for 
the classified sequences, also substantially less than the ~2 
h required for manual operation, suggesting the significance of 
autoSeqMan in dealing with large date sets. 


LIMITATIONS 


It is important to note that autoSeqMan does not undertake 
any filtration manipulation on the sequence data, even though 
poor-quality sequence ends are always present. Thus, after 
running autoSeqMan, users should undertake quality control 
measures of the final assembly with SeqMan. In addition, the 
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output Fasta files will have very long IDs, which might introduce 
some errors in subsequent sequence analyses. If necessary, 
users can use the "Sort & Rename" function of FasParser (Sun, 
2017) to shorten these IDs. 
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Complete mitochondrial genome of the Thai Red 
Junglefowl (Gallus gallus) and phylogenetic analysis 


DEAR EDITOR, 


In this study, we sequenced the complete mitochondrial 
genome (mitogenome) of the Thai Red Junglefowl (RUF; 
Gallus gallus) using the next-generation sequencing (NGS) 
platform of the lon Torrent PGM. Samples were taken from 
Mae Wang District, Chiang Mai Province, northern Thailand. 
Our data showed the complete mitogenome to be 16 785 
bp in length, composed by 13 protein-coding genes, 22 
tRNA genes, two rRNA genes, and one control region. The 
genome nucleotide composition was 30.3% A, 23.7% T, 32.5% 
C, and 13.5% G, resulting in a high percentage of A+T 
(50.4%). Phylogenetic analysis revealed that the mitogenome 
belonged to haplogroup X, whereas those of all domestic 
chickens belong to haplogroups A to G. This newly released 
mitogenome sequence will advance further evolutionary and 
population genetics study of the RJF and domestic chicken. 
The availability of the G. gallus mitogenome will also contribute 
to further conservation genetics research of a unique species, 
listed as 'data deficient' in Thailand. 

The Red Junglefowl (RUF; Gallus gallus) is a major wild 
ancestor of the domestic chicken (Darwin, 1875; Liu et 
al., 2006; Miao et al, 2013). Early studies based on 
mitochondrial DNA (mtDNA) revealed that the Thai RJF has 
a close relationship with the domestic chicken (Fumihito et al., 
1994; Fumihito et al., 1996), implying that Thailand is likely 
a domestication center of the chicken. To the best of our 
knowledge, no complete mtDNA genome (i.e., mitogenome) 
sequence of the Thai RJF has been reported. In this study, we 
collected a RJF sample from the Mae Wang District of Chiang 
Mai Province in northern Thailand (permission provided by the 
Thai Institute of Animals for Scientific Purpose Development 
(No. U1-01205-2558)). The complete mitochondrial genome 
was submitted to GenBank (accession No.: MG605671). 

Genomic DNA was extracted from whole blood using 
the HiPure Tissue DNA Micro Kit (Magen, China). The 
PCR amplification, library construction, and next-generation 
sequencing were in accordance with our earlier study (Chen 
et al., 2016). We used a de novo long fragment PCR 
and NGS strategy to obtain high quality mtDNA reads and 
exclude NTMT pseudogenes. We followed caveats for quality 
control in mtDNA genomic studies of domestic animals (Shi 
et al., 2014). The generated sequence was aligned against 
a reference sequence AP003321 (Nishibori et al., 2005), with 
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all variants then output. Using the Integrative Genomics Viewer 
(Thorvaldsdottir et al., 2013), we checked the bam file exported 
by Torrent Suite 5.0.2 to confirm the scored variants. 

Phylogenetic analysis was performed using complete 
mtDNA sequences of all major haplogroups and 
sub-haplogroups, as defined by Miao et al. (2013) and 
Peng et al. (2015). All mitogenomes were aligned by ClustalW, 
then analyzed by maximum parsimony (MP) in MEGA 7.0 with 
1 000 bootstrap replicates (Tamura et al., 2011). 

The complete mitogenome sequence of the Thai RUF (16 785 
bp; GSA accession No.: PRJCA000287, GenBank accession 
No.: MG605671) had an overall base composition of 30.3% 
for A, 23.7% for T, 32.5% for C, and 13.5% for G, with high 
a A+T content of 54.0%. The mitogenome consisted of 13 
protein-coding genes, 22 tRNA genes, two rRNA genes, and 
a displacement loop (D-loop). Most mitogenome genes were 
encoded on the heavy strand, except for eight tRNA genes and 
one protein-coding gene (ND6), which were encoded on the 
light strand. Some protein-coding genes shared the start and 
stop codons; for instance, all 13 genes began with ATG, except 
for COX1, which started with GTG, and of the remaining 12 
protein-coding genes, nine (ND1, COX2, ATPase8, ATPase6, 
ND3, ND4L, ND5, cyt b, and ND6) shared the stop codon TAA, 
two (COX3 and ND4) shard the stop codon “T— —', and ND2 
used TAG. The lengths of the 12S rRNA and 16S rRNA genes 
were 976 bp and 1 622 bp, respectively. 

As the RJF is a wild type of Gallus gallus, it differed 
from all major domestic haplogroups (A to G). We calculated 
the differences between the Thai RJF and randomly selected 
individuals with haplogroups A to G (Table 1). Compared with 
the common domestic haplogroups, the RJF had 40 different 
base pairs on average (range: 29-53). 
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Table 1 Mitochondrial genome differences vs. Thai Red replications using the MEGA 7 software package (Tamura et 
Junglefowl (PRJA000287) al., 2013). The tree showed that the Thai RUF mitogenome 
clustered with GU261692 from Yunnan (Miao et al., 2013) into 


Haplogroup Sequence Different base number (n) haplogroup X, as defined by Miao et al. (2013) and Peng et al. 








A GU261684 48 (2015). To the best of our knowledge, all domestic chickens are 
B GU261714 45 distributed in haplogroups A to G. However, the arrangement of 
C1 GU261679 53 the RJF was identical to haplogroup X, which is only found in 
D GU261683 42 wild chickens. In the MP tree, all haplogroups had very high 
bootstrap values, all larger than 90%. The root of the MP tree 
E GU261712 29 : ! : 

was in haplogroup Y, according to the Chicken Reference Tree 
F QU261717 M in dometree.org (Peng et al., 2015). Thus, our study strongly 

G GU261676 31 supported the previously defined reference tree. 
Here, we assembled the first complete mitogenome of the 
Thai RJF. This study will provide useful information for future 
A phylogenetic tree (Figure 1) of all known Gallus gallus evolutionary and population genetics analyses of the RJF and 

mitogenomes was constructed and tested with 1 000 bootstrap domestic chicken. 
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Figure 1 Phylogenetic tree of 41 complete mitochondrial genomes of Gallus gallus constructed with maximum parsimony 
Black dot represents the new sample in this study. Bootstrap support values (>50) are shown at the nodes. 
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